





‘ 
4 
. 

. 

j 

: 

i 5 
' ‘ 
’ 
»* 
. 
. 
: 
: 
: , 
; , 
i 








Number 4, Part | 


Volume 49 OCTOBER 1954 





The Journal of Abnormal 


and 


Social Psychology 


Editor: J. McV. Hunt, University of Illinois 
Associate Editor: ARTHUR JENNESS, WILLIAMS COLLEGE 
Book Review Editor: STANLEY G. Estes, HARVARD UNIVERSITY 
Managing Editor: LORRAINE BOUTHILET 


CONSULTING EDITORS 
O. HoBart MowreErR 
University of Illinois 
LAWRENCE I. O’KELLY 
University of Illinois 
CHARLES E. Oscoop 
University of Illinois 
SAUL ROSENZWEIG 
Washington University 
HAROLD SEASHORE 
Psychological Corporation 
Davip SHAKOW 
University of Minnesota National Institute of Mental 
Cuar.es M. HarsH Health 
U.S. Navy Electronics Laboratory RicHarp L. SoLomon 
San Diego Harvard University 
Ross STAGNER 
University of Illinois 


GorpDon W. ALLPORT 
Harvard University 
HADLEY CANTRIL 
Princeton University 
LEE J. CRONBACH 
University of Illinois 
JouN DoLLARD 
Yale University 
LEONARD W. Doos 
Yale University 
LEON FESTINGER 


WriurM A. Hunt 
Northwestern University 
W. Ernst Kris IVAN STEINER 
Yale University School of Medicine —_ University of Illinois 


DonaLp W. MacKINNON 
University of California 

Quinn McNEMAR 
Stanford University 


ELIZABETH COLLINS WESMAN 
White Plains, N. Y. 
RosBert W. WHITE 
Harvard University 


CHARLES WRIGLEY 
University of Illinois 
PUBLISHED BY 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION, Inc. 


1333 SIXTEENTH STREET N.W., WASHINGTON 6, D.C. 
anpD MT. ROYAL & GUILFORD AVES., BALTIMORE 2, MD. 


Entered as second-class matter February 15, 1954 at the Post Office at Baltimore, Maryland under 
the Act of February 28, 1925. Acceptance for mailing at special rate of postage pro- 
vided for in Section 34.40 (e), P. L. and R. Additional entry at 
Washington, D.C. 


Copyright 1954 by the American Psychological Association, Inc. 











The Journat is issued quarterly in January, April, July, and October. The yearly 
volume averages 544 pages. The subscription price per volume is $7.00; foreign, $7.50. 


Single issues are $2.00. 


Original manuscripts and editorial communications should be sent to: 


Dr. J. McV. Hunt 
Department of Psychology 
University of Illinois 
Urbana, Il. 


Books for review should be sent to: 
Dr. STANLEY G. EsTEs 
Psychological Clinic 
64 Plympton Street 
Cambridge 38, Mass. 


Fifty free offprints are supplied to contributors of articles. For book reviews, five 
copies of the JouRNAL are supplied gratis if requested. Authors of prior publication 


articles receive no gratis offprints or journals. 


Printed and mailed by: 
WAVERLY Press, INc. 
Baltimore 2, Maryland 


Subscriptions and business communications should be sent to: 


THE AMERICAN PsyCHOLOGICAL ASSOCIATION, INC. 
1333 Sixteenth Street N.W. 
Washington 6, D. C. 


Changes of address must reach the Subscription Office by the 25th of the month to 
take effect the following month. Undelivered copies resulting from address changes will 
not be replaced; subscribers should notify the post office that they will guarantee sec- 
ond-class forwarding postage. Other claims for undelivered copies must be made with- 


in four months of publication. 








PREDICTION OF FLYING SUCCESS BY CLINICAL ANALYSIS 
OF TEST PROTOCOLS! 


W. H. HOLTZMAN 


University of Texas 


AND S. B. SELLS 
USAF School of Aviation Medicine 


N RELATION to the problem of developing 
a battery of personality and motivational 
measures to predict adjustment to mili- 
tary and combat flying, the following questions 
were considered: (a) Can a group of expert 
clinical psychologists, given a description of the 
problem and a definition of the criterion, pre- 
dict this criterion on the basis of a battery of 
personality tests? (6) If such predictions can 
be made successfully, what are the effective 
predictor signs which the clinicians use, and 
how do they integrate these signs to obtain 
successful predictions? The experiment re- 
ported below was undertaken primarily for 
the purpose of developing hypotheses for the 
systematic, quantitative scoring of a group of 
experimental psychiatric screening tests for 
flying personnel. 

The six tests in this study were part of one 
experimental battery of group tests similar to 
those used by psychologists in clinical prac- 
tice. These tests are: (a) Background Infor- 
mation, a biographical history inventory; (0d) 
Ink-Blot Test, a group adaptation of the 
Rorschach test; (c) Feeling and Doing, a psy- 
chosomatic inventory reporting both symp- 
toms experienced and what the subject did 
about them; (d) What Is He Saying?—a sen- 
tence completion test designed for Air Force 
use; (e) L-D Test, a group test booklet of the 
Szondi test; and (f) Drawing Test, a group 
version of the Draw-A-Person test. These tests 
have been described in detail by Sells and 
others in a series of Air Force publications 
(1, 2, 4, 6, 7). 

A major problem in adapting these clinical 
testing procedures to selection use is that of 

1 The research reported here was conducted under 
terms of Contract No. AF 33(038)-13887 between the 
U. S. Air Force and The University of Texas in co- 
ordination with the Department of Clinical Psychology, 
USAF School of Aviation Medicine, Randolph Field, 
Texas. This research was discussed at a symposium en- 
titled “Clinical Prediction of Flying Success,”” American 
Psychological Association, Washington, D. C., Septem- 
ber, 1952. 
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developing adequate formal, quantitative scor- 
ing techniques that meet the requirements of 
psychometric standards and at the same time 
incorporate the interpretive schemes of the 
clinical method. This problem was attacked in 
a number of ways, for the most part by em- 
pirical analyses in which empirical scoring 
keys were developed by painstaking and sys- 
tematic comparison of the responses of dis- 
crete bipolar criterion groups and cross-vali- 
dated on independent samples.? The present 
experiment was undertaken as one part of the 
coordinated attack on this problem. 


METHOD 


After careful consideration of the test and criterion 
data, the probable availability of skilled clinicians to 
evaluate the test protocols, and the objectives of the 
research program, an experimental design for the 
clinician study was developed. From a total of 1,504 
test subjects (Ss) for whom complete criterion data 
were available, 100 aviation cadets were selected for 
the present study. Fifty of them had made a highly 
successful adjustment to the stress of flight training, 
and 50 had been eliminated from the program because 
they developed overt personality disturbances and were 
unable to complete the training course. The nature of 
the overt personality disturbances varied, but in every 
case the disturbance was sufficiently severe to be called 
to the attention of the flight surgeon, psychiatrist, and 
psychologist. The criteria used for selection of the 100 
cadets are: 


Successful Group 


1. Passed the flight training course 
2. Pilot aptitude of Stanine 6 or higher 





? Of the six tests used in this study, empirical studies 
have thus far developed valid scoring keys for three. 
Reports on the Rorschach (6) and Sentence Completion 
Test (7) have been published. Unpublished results on 
the Background Information inventory are reported in 
a general discussion of the psychiatric selection research 
program by Sells and Barry (5). Two studies of the 
Draw-A-Person test, one by Anastasi and Foley (1) 
and an unpublished study by Nurse and Barry, a series 
of analyses of the Szondi test by Ellis and Sells, and an 
extensive study of the Feeling and Doing test by Barry 
and Cobb have all yielded negative findings on cross- 
validation of experimental keys. 
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3. No evidence of personality or disciplinary dif- 
ficulty 

4. Leadership rating above the 75th percentile as 
judged by fellow cadets 

5. Satisfactory ratings by four independent instruc- 
tors on over-all suitability for role of Air Force 
pilot and officer 


Unsuccessful Group 


1. Elimination from the flight training program for 
reasons Other than physical disability 

2. Pilot aptitude of Stanine 6 or higher 

3. Evidence of overt personality disturbance con- 
tributory to elimination 


Since the number of cadets meeting these criteria 
was more than 50 in each group, 50 successful and 50 
unsuccessful cadets were selected at random from the 
available cases. Code numbers were assigned to these 
100 individuals, and five independent sets of 20 cadets 
each were drawn up randomly. In this manner, the a 
priori probability that a given cadet passed or failed 
was fixed at .50, with the number of successful cadets 
in a given set of 20 varying randomly about an ex- 
pected value of 10. Consequently, the success of a given 
clinician could be evaluated directly by use of the 
binomial distribution. 

Each test protocol was photographed, and sufficient 
photostatic copies were made to provide each par- 
ticipating psychologist with a complete set of test 
materials for his particular group of 20 cadets. 

Meanwhile a select list of psychologists, many of 
whom had served as military psychologists during 
World War II, was compiled; it consisted chiefly of 
recognized authorities in clinical procedures as applied 
to test interpretation and psychodiagnosis. As partial 
remuneration for the time-consuming clinical evalua- 
tion of 20 cadets, a small honorarium of $100 was 
offered each collaborator for his participation. The 19 
psychologists who accepted this invitation are: 

Lawrence Abt James Layman 
Arthur Benton Ivan Mensh 
Sidney Bijou Daniel Miller 
Gerald Blum Z. A. Piotrowski 
Charles Cofer Victor Raimy 
Molly Harrower Thomas Richards 
Max Hutt William Schofield 
Seymour Klebanoff Meyer Williams 
Bruno Klopfer Lee Winder 
Samuel Kutash 


TABLE 1 
EXPERIMENTAL DESIGN 


No. of Cadets 
Who Passed 


No. of 
Clinicians 


No. of 
Group Cadets 
20 
20 
20 
20 
20 
100 
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To provide uniform background information con- 
cerning the nature of the criterion and the flight-training 
program, three chapters from Sells’s (4) comprehensive 
outline of the research program which described the 
population and the case histories of unsuccessful 
cadets and outlined the critical requirements for suc- 
cess were included with the test materials. Detailed 
instructions for making and recording judgments, a 
brief summary of the over-all experimental! design, and 
a short description of each test were also enclosed. 


Two forms of recording sheet were used, one for re- 
cording judgments based on only one test at a time 
and one for recording global evaluations based on 
consideration of all or part of the available data for a 
given individual. Although each psycholegist was en- 
couraged to use first the single-test approach and then 
the global approach, the single-test analysis was made 
entirely optional. In addition to judging pass-fail for 
each cadet, each psychologist was asked to state, where 
possible, the cues which formed the basis for his evalua- 
tion. A three-point “confidence” scale was also pro- 
vided so that each clinician could state the degree of 
confidence he had in the validity of each judgment he 
made. A confidence rating of ““A’’ was to be given when 
the clinician felt quite sure of his judgment; a rating of 
“B” was to be used when he felt his judgment was 
questionable; and a rating of “C” denoted no con- 
fidence whatsoever in the judgment. 

Table 1 summarizes the essential features of the 
experimental design. Four of the five sets of test 
protocols were each judged independently by three 
different psychologists; the fifth set was evaluated by 
seven psychologists. Replication through judges made 
possible a study of the clinician as a variable and per- 
mitted the use of pooled judgments for a single case. 
The additional replication in the fifth group was de- 
signed to provide a more accurate estimate of inter- 
clinician variability than was possible with only three 
clinicians per cadet. 

The experimental design used in the present study 
permits several different approaches to testing the 
general hypothesis that adjustment to the training 
program could be predicted significantly better than 
chance by skilled clinicians from an examination of 
individual test protocols. Two sets of predictions were 
made by most of the psychologists. Every psychologist 
made a prediction of success or failure for each individ- 
ual in his group of 20 cadets. One class of predictions 
consists of these over-all or global evaluations. In addi- 
tion to using the global approach, 12 psychologists made 
predictions for each cadet based upon consideration of 
only one test at a time. A second ciass of predictions 
consists of these single-test evaluations. In either case, 
the data may be examined for one psychologist inde- 
pendently of the others, or combined predictions can 
be obtained by pooling the judgments of individual 
psychologists for each cadet. 


RESULTS 


Let us first examine the validity results of 
the global judgments. The number of correct 
judgments among the 19 psychologists ranged 
from 4 to 14 with a mean of 10.2. Since 20 





PREDICTING FLYING SuCCESS BY CLINICAL ANALYSIS 


TABLE 2 
Success OF OvER-ALL (GLOBAL) EVALUATIONS FOR 
INDIVIDUAL CLINICIANS* 


Correct 
Predic- 
tions 


Clinician 
Code 
No.t 


Correct 
Predic- || 
tions 


Clinician 
Code 
No.t 


Group Group 


11 12 
13 
12 
8 
8 
10 
9 
12 
10 
6 


* Chance expectation is 10 correct out of 20. Mean number of 
correct predictions per clinician is 10.2. 

+t The code numbers have no relation to the order of listing of 
the clinicians’ names. 


cadets were judged by each psychologist, and 
since the a priori probability that a single pre- 
diction is correct by chance alone is .50, the 
expected chance number of correct predictions 
for a single clinician is 10. The results of this 
analysis are presented in Table 2. If we disre- 
gard confidence ratings, none of the psycholo- 
gists was able to predict adjustment to train- 
ing significantly better than chance with the 
global approach to test evaluation.* 

A similar analysis was made using only the 
predictions with confidence ratings of A, which 
signified that the clinician was quite sure of 
his judgment. The number of over-all predic- 
tions that were rated A by a single psychologist 
varied from 0 to 12. The a priori chance ex- 
pectation for number correct is one-half the 
total number of cadets assigned an A confi- 
dence rating in each case. Only two psycholo- 
gists (L and S) were able to predict the actual 
adjustment of the cadets better than chance at 
the 5 per cent level of significance. These 
results, however, must be interpreted with 
great caution since seven of the psychologists 
had fewer correct predictions rated A than 
would be expected by chance alone. 

Similarly, when the results obtained for the 
B and C confidence ratings are considered 
alone, none of the predictions rises significantly 
above chance expectation. 

Since it seems quite reasonable that greater 


3 One psychologist (R) did make predictions which, 
when considered alone, were significantly worse than 
chance. 


TABLE 3 
Success OF PooLep GLOBAL EVALUATIONS 


Chance 
Expecta- 
tion of 
Correct 
Predic- 
tions 


Number of 
Correct 
Predictions 


Condition 


54 out of 100 
17 out of 35 


Majority agreement 
Majority agreement (A 
ratings only) 


Unanimous agreement 28 out of 50 


stability of predictions can be obtained by 
pooling judgments, the global evaluations 
made by each psychologist for a given cadet 
were combined. Two different methods of 
pooling judgments were used. In one case, 
whatever was predicted by a simple majority 
(2 out of 3, or 4 out of 7) of psychologists was 
accepted as the pooled judgment. A more 
severe criterion of agreement was obtained by 
considering only those cadets for whom there 
was unanimous agreement in Groups 1-4 and 
a majority agreement of 5 out of 7 psycholo- 
gists in Group 5. On an a priori basis, the ex- 
pected number of correct predictions by chance 
alone is one-half the total number of cadets for 
whom predictions are available. Inspection of 
Table 3 clearly reveals that there is little re- 
lationship between actual adjustment and 
predictions obtained by pooling global 
judgments. 

When judgments based upon the single-test 
approach are examined, the results are essen- 
tially the same. Twelve of the clinicians made 
judgments using the single test approach as 
well as the global approach. When confidence 
ratings are disregarded, only 3 of 68 different 
sets of predictions were sufficiently accurate to 
prove significant at the 5 per cent level. When 
only those predictions which the clinician rated 
A, indicating that he was quite sure of his 
judgment, are considered, the results are essen- 
tially the same. Only one set of predictions, 
that made by psychologist B using Back- 
ground Information, is significant at the 5 per 
cent level. Since a total of 68 significance tests 
was implicitly made, one for each set of pre- 
dictions, the apparent success of psychologist 
B may well be considered a chance fluctuation.‘ 


*A partial test of this hypothesis was made for 
psychologist C who did about as well as B on Back- 
ground Information. C offered to make predictions for 
two additional groups of 20 cadets each (Groups 2 and 
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As in the case of the global evaluations, an 
analysis was made of the results obtained for 
each separate test, using majority agreement 
of individual clinicians as a basis for predic- 
tion. The number of correct predictions did 
not differ significantly from chance expectation 
for any of the six tests. Clearly, there is little 
relationship between actual adjustment and 
predictions based upon an analysis of one test 
at a time 

DISCUSSION 

In the present study there is little doubt 
that the clinical assessments of beginning avia- 
tion cadets have no relationship to a criterion 
of adjustment in the basic flight-training pro- 
gram. The objective findings can hardly be 
ignored, and it is of utmost importance to con- 
sider the possible explanations of these results 
in order to appreciate fully their significance. 

1. Are the criterion data faulty? A major 
factor in the success or failure of any predic- 
tion experiment is the validity of the criterion 
measure. This immediately raises the question 
“Validity for what?” The present study is con- 
cerned with predictions of the adjustment to 
the stresses of flight training of two groups of 
aviation cadets carefully selected to reflect 
highly effective and highly inadequate patterns 
of behavior. The clinician is called upon to 
predict (in retrospect) the future behavior of 
each cadet on the basis of test performance 
obtained when he first entered training. In this 
sense, the evaluations called for are prognostic 
of later adjustment rather than diagnostic of 
present behavior. 

The likelihood that our criterion groups of 
successful and unsuccessful cadets are in error 
is small. Few, if any, experimental investiga- 
tions of adjustment to stress have been based 
upon criteria as objective and manifold as 
those used in the present study. Although it is 
true that some members of the successful group 
may later experience breakdown under high 





5), using only cues provided by Background Informa- 
tion. To aid him in improving his subsequent judg- 
ments he was given the criterion data for his original 
set of protocols (Group 1). C first made predictions for 
the 40 cadets without specific knowledge of his earlier 
successes and failures. Twenty-five of the 40 predictions 
later proved to be correct. Then C re-evaluated the 40 
test records after examination of the earlier data, and 
changed several predictions. Twenty-two of the 40 
predictions were correct. Since in both cases chance 
expectation was 20 correct, the null hypothesis could 
not be rejected. 
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situational stress and may manifest behavior 
indicative of poor adjustment, the anxiety, 
functional disorders, and overt personality 
disturbance which played a prominent role in 
the failure of each cadet in the unsuccessful 
group are outright evidence of maladjustment 
and preclude consideration of these cadets in 
any other category. 

One explanation of the inability of the psy- 
chologists to predict the criterion may be the 
relatively restricted range of adjustment. 
Since each cadet entering flight training had 
successfully passed a number of hurdles prior 
to admission, the likelihood of an individual 
with severe neurotic or psychotic manifesta- 
tions being included in the test population is 
extremely small. This objection has been 
partially overcome by selecting a small ex- 
perimental sample of 100 cases from a much 
larger group in such a way as to reflect only 
the extremes. This was deliberately done to 
maximize the likelihood of obtaining significant 
predictions. 

2. Is it possible that test evaluations were 
valid with respect to adjusiment at the lime the 
test data were gathered, and that later adjustment 
has little relationship to assessment of initial 
personality status? Because of the exhaustive 
mental, physical, and educational require- 
ments for aviation cadets, an overtly neurotic 
or disturbed candidate is rare. However, the 
problem of prediction is to discover latent or 
subclinical characteristics which constitute 
predisposition to later manifestation of ex- 
cessive stress reaction when the actual stresses 
of military flying are experienced. Hence, if 
the clinical assessments were concerned with 
manifest indications of disturbance at the time 
the tests were administered, it might be ex- 
pected that they would be unrelated to later 
adjustment, unless a very disturbed applicant 
were found. It is possible that this task is un- 
fair to the clinicians before more detailed 
empirical research data are available on the 
signs indicative of predisposition. However, at 
the time the research was undertaken it was 
hoped that the insights of the clinicians might 
facilitate the prosecution of the empirical stud- 
ies. The results of the empirical studies which 
have been completed, such as those with the 
Rorschach, sentence completion, and other 
tests, confirm the conclusion that the items 
found most discriminating are frequently not 
those which were expected on a priori grounds. 
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3. Were the judges sufficiently familiar with 
the prediction problem to evaluate the test pro- 
tocols in a proper context? The psychologists 
who participated in this study represented a 
wide variety of interest and background. Sev- 
eral, while recognized as highly skilled clini- 
cians, had had little or no contact with military 
populations. The majority, however, had 
served during the past war as military psy- 
chologists, and a few were thoroughly familiar 
with the problem, having either gone through 
a flight-training program themselves or worked 
as psychologists on similar problems for the 
Air Force. In addition, to ensure some famili- 
arity with the problem, an extensive discussion 
of the training situation and the types of stress 
present, together with several representative 
case reports of cadets who failed to adjust, was 
forwarded to each participant. The uniformly 
negative results strongly suggest that lack of 
familiarity with the training situation and the 
types of adjustment which characterized the 
criterion groups is not a deciding factor. 

4. Did the judges have sufficient previous ex- 
perience with the types of test materials to inter- 
pret them properly? Inexperience with the tests 
used in the Randolph Field battery may have 
contributed appreciably to the lack of positive 
findings. To make feasible their administration 
to large groups, each of the six tests had been 
adapted from more familiar individual testing 
procedures. The Ink-Blot Test was a mixture of 
the commonly accepted individual and group 
Rorschach tests with one or two minor in- 
novations. The L-D Test was a group form of 
the Szondi with only one trial rather than 
several spread over different days as is stand- 
ard practice when individually administered 
in a Clinical setting. Though similar in struc- 
ture to other such tests, the SAM Sentence 
Completion Test (What Is He Saying?) was 
devised especially for use with student pilots. 
A large number of the test items, however, 
were identical to those used in earlier, well- 
known tests of this type. Except for the lack 
of an “inquiry,” the group form of the Draw- 
A-Person test was essentially similar to the 
standard individual procedure. Similarly, the 
general type of response given in Feeling and 
Doing and in Background Information was 
familiar to most clinicians, although the con- 
text in which the responses were made was 
different from the usual clinical setting. 

Several participating psychologists have ex- 
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pressed the belief that these modifications of 
standard procedures in order to make group 
administration feasible may have changed the 
meaning of test responses as commonly inter- 
preted in clinical practice. Others have argued 
that the basic response patterns should remain 
relatively constant, although less information 
may be available in the group tests. Whatever 
the case may be, it is worth noting that many 
of the psychologists felt they were making 
meaningful interpretations of the data. 

5. Is it possible that the personality evaluations 
were largely correct, but the criterion predictions 


failed due io inadequate integration of the various 


personality characteristics observed? The judg- 
ment process consists of at least two phases: 
(a) the description of the individual’s person- 
ality, and (6) the weighing of assets and lia- 
bilities noted in the individual and the pre- 
diction of adjustment to the training program. 
Since none of the predictions exceeded chance 
expectations, it is impossible to tell from the 
predictions alone whether the personality de- 
scriptions are invalid or whether they have 
been used inappropriately in arriving at pre- 
dictions. In all likelihood both phases of judg- 
ing have suffered from faulty inferences. Be- 
cause at least three different psychologists 
evaluated each cadet, the variation in per- 
sonality descriptions from one clinician to the 
next in the kind and degree of personality 
descriptions written makes a thorough, sys- 
tematic analysis of this type impossible. Even 
when only those cases on which there is 
unanimous agreement as to predicted outcome 
are considered, there is little congruence of 
terminology, each clinician favoring concepts 
related to his own frame of reference. A major 
problem in further analysis of data of this 
kind is finding an adequate system for classify- 
ing descriptive terms to make comparisons 
among clinicians. 

In spite of such variability, there are under- 
lying consistencies among clinicians which 
cannot be ascribed to chance alone. The a 
priori probability that three judges will unani- 
mously agree in their predictions for a 
single cadet is .25. For the global approach the 
amount of agreement among the psychologists 
was significantly better than chance. There 
was a low but statistically significant agree- 
ment in predictions based upon Background 
Information, Feeling and Doing, and the 
Drawing Test, when considered individually. 











490 


Less reliability was present in predictions 
based upon the Ink-Blot Test and the SAM 
Sentence Completion Test. No agreement 
among clinicians was observed for the L-D 
Test. Although none of the psychologists was 
able to predict beyond chance expectation the 
actual adjustment of the cadets, they did tend 
to use the test battery as a whole, and to some 
extent the individual tests, with some degree 
of consistency. 

To see what cues formed the basis for judg- 
ments, the evaluations of cadets upon whom 
there was unanimous agreement were ex- 
amined. A rating of failure seemea to depend 
frequently upon the presence of signs of low 
mctivation for flying, immaturity, strong 
dependency needs, anxiety, and sexual conflict. 
Ratings of success were less consistent and 
often based upon conflicting cues. In general, 
too few clinicians listed cues in sufficient num- 
bers to permit any systematic analysis. Further 
development and standardization of terms for 
the description of personality must take place 
before any appreciable understanding of the 
process of judging will be possible. 

In view of the results of the present study, 
several questions immediately arise. What are 
the implications of these findings with respect 
to the problem of predicting individual adjust- 
ment to other life stress situations? Can these 
results be accepted as reliable evidence of the 
general invalidity of clinical assessment pro- 
cedures as presently constituted for coping 
with prediction problems of this nature? How 
do the results of this study fit in with the gen- 
eral picture as it is gradually unfolding in 
somewhat similar studies such as the Michigan 
VA clinical trainee assessment program? 

In the opinion of the writers, the improve- 
ment of clinical judgments based on psycho- 
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logical tests and other clinical evidence will 
depend upon the completion of adequate quan- 
titative, controlled empirical research relating 
standardized measurement instruments to spe- 
cific criterion situations. Such situations in- 
clude diagnostic groups, job criterion groups, 
and various other life situations. In this re- 
spect, the present results are closely related to 
those of the Michigan program. 
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THE INFLUENCE OF SET ON SEMANTIC GENERALIZATION! 


NATHANIEL H. EISEN 


New York University* 


EMANTIC generalization refers to the gen- 
eralization to other semantically related 
words of a response originally condi- 

tioned to a single word. This is a form of 
learned, mediated, or secondary generalization, 
a concept which has become increasingly im- 
portant to those who attempt to describe com- 
plex human behavior in terms derived from 
Hull’s learning theory. For example, Cofer 
and Foley (1) have used secondary generali- 
zation to explain such phenomena as “mean- 
ingfulness,” set, memory, reasoning, intelli- 
gence, and “unconscious anxiety.” Dollard 
and Miller (3) place considerable emphasis on 
the concept in their discussion of the devel- 
opment of personality and the process of psy- 
chotherapy. They attribute the transfer of 
anxiety to new situations and the development 
of neurotic symptoms in part to secondary 
generalization; similarly, the development of 
the transference relationship during therapy 
is due in part to “‘response-mediated . . . mech- 
anisms of generalization.”” They suggest that 
a major function of the therapist is to help the 
patient in identifying his generalized responses 
and in “... attaching the same label or other 
cue-producing responses to different situations 
[which] should increase the amount of second- 
ary stimulus generalization and [in this way] 
help the patient to respond adaptively to cul- 
turally emphasized similarities” (3, p. 323). 


1 This article is based on a dissertation submitted in 
partial fulfillment of the requirements for the degree of 
Doctor of Philosophy at New York University, Febru- 
ary, 1952. The writer wishes to express his indebtedness 
to Professor Leland W. Crafts for his guidance and 
assistance throughout the course of this study. The 
writer also wishes to thank Dr. Bernard Locke, of the 
VA Regional Office, Brooklyn, New York, who pro- 
vided the psychogalvanometer and the polygraph; Drs. 
Jacob Cohen and Lawrence Karlin, for advice and 
assistance with regard to statistical problems; and the 
many members of the teaching staffs, of the Washing- 
ton Square College and the School of Education of 
New York University, through whose efforts subjects 
were obtained. Above all, the writer wishes to express 
his gratitude to Benjamin S. Brody, who designed and 
built the shock apparatus and the various electrodes, 
who served as assistant during the collecting of the 
data, and who in many ways made this experiment 
possible. 

2 Now at the University of Illinois. 
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The experimental evidence for semantic 
generalization has been reviewed most recently 
by Razran (14). A few of the previously re- 
ported studies are directly related to the ex- 
periment reported here. Razran (11) found 
that the salivary response conditioned to one 
word showed greater generalization to words 
of similar meaning (synonyms) than to words 
of the same sound (homonyms or homo- 
phones). Riess (15), using the GSR as his CR, 
and a buzzer as his CS, found that adult Ss 
showed generalization to antonyms interme- 
diate between that to synonyms and homo- 
nyms, thus obtaining a gradient of generali- 
zation from synonyms to antonyms to 
homonyms. Wylie (17), also using the GSR, 
but shock as her US, obtained maximum gen- 
eralization to homonyms, with greater gener- 
alization to synonyms than to control words. 
Cofer and Foley (1) explain the discrepancy 
between Wylie’s findings and those of Riess 
and Razran on the basis of the “intentions,” 
“expectations,” and “beliefs” held by Wylie’s 
Ss as to the purpose of her experiment. 

The related concepts of set, attitude, ex- 
pectancy, and the like have frequently been 
invoked either explicitly or implicitly to ex- 
plain experimental findings in this area in a 
posteriori fashion (e.g., 1, 2, 15, 17). However, 
few studies have reported the effects of ex- 
perimental variations in Ss’ set on semantic 
generalization. Razran (13) found that (a) 
manipulation of “mental sets,” induced by 
practice with lists of controlled associations, 
could increase or decrease the amount of gen- 
eralization to words in such semantic relation- 
ships to the CS as subordinate, supraordinate, 
and the like, according to the kind of practice 
involved; and (6) Ss with different political 
views differed in the directions to which they 
showed generalization to sentences reflecting 
varying political philosophies. 

HYPOTHESES 

Major hypothesis. The amount of generali- 
zation to a word having a given semantic re- 
lationship to the CS should be greater when a 
set has been induced for generalization to that 
relationship than it is to a word in the same 
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relationship when a set has been induced for 
generalization to other relationships. 

This hypothesis may be deduced from the 
Cofer and Foley (1) elaboration of Hullian 
theory, and it is also consistent with Razran’s 
theoretical position (14). It does not seem to 
have been stated and tested explicitly, but is 
implicit in some of the work of Razran (13) 
and others (e.g., 2). An explicit statement of a 
related hypothesis is made by Dollard and 
Miller, although in somewhat different lan- 
guage (3, p. 323). 

Minor hypothesis I. Generalization as de- 
scribed above should be significantly greater 
when an “implicit set” is induced toward a 
given semantic relationship by having S prac- 
tice in an apparently unrelated task than 
it is when a set is induced by means of ex- 
plicit instructions directed toward the same 
relationship. 

Dollard and Miller (3), in their discussion 
of therapy (pp. 281 ff.), mention three ways in 
which people may be induced to attach 
“labels” or other “‘cue-producing responses” to 
previously unlabeled situations. These are 
roughly analogous to three experimental 
methods for inducing a set: (a2) The therapist 
may so arrange matters that the individual 
spontaneously finds the proper labels himself, 
which is similar to the condition existing in the 
experiments by Razran (11) and Riess (15) in 
which Ss were permitted to discover the rela- 
tionships of the generalization stimuli to the 
CS with no help from E. (6) He may strengthen 
some of the patient’s random responses by 
means of “selective repetition” having an 
“automatic reward effect,” which is analogous 
to the “implicit set” in the experiment re- 
ported here. (c) He may take an active role in 
teaching the patient by explicitly pointing out 
relationships; this is analogous to the explicit 
instructions used here for inducing a set. Ac- 
cording to Dollard and Miller, the first of these 
methods should be the most effective, because 
it offers the greatest amount of satisfaction or 
reinforcement to the patient or S, whereas the 
last method should be the least effective. 

Razran (13) has studied the effects on seman- 
tic generalization of a set induced by means of 
practice with lists of controlled associations, 
but we find no studies in the literature on 
generalization comparing the effects of differ- 
ent methods of inducing sets. 

Minor hypothesis 11. Generalization as de- 
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scribed above should be greater to words of 
high frequency value on the Thorndike-Lorge 
list (16) than to words of low frequency value. 

This hypothesis is derivable from Hullian 
theory, and was indirectly tested by Cofer, 
Foley, and their associates (e.g., 5) in experi- 
ments involving the recall of lists of words. It 
is also consistent with Razran’s work (e.g., 10). 
However, according to Lashley and Wade, 
“.. the practice required in learning [is] a 
wholly unreliable measure of the learning 
process .. . So long as the effective attribute is 
present, the reaction [i.e., generalization] is 
elicited as an all or none function of that attri- 
bute” (9, pp. 74, 81), presumably regardless of 
the frequency value of the stimuli. 


METHOD 


Subjects. This experiment required 64 Ss to meet 
two criteria for the establishment of a conditioned 
GSR. In order to obtain this number of Ss, 95 individ- 
uals were seen initially. These were students at various 
schools of New York University during the spring and 
summer sessions of 1951. About 50 per cent were volun- 
teers; the remaining were recruited from courses in 
general psychology in which service as the S of an 
experiment was required. The Ss were assigned at ran- 
dom to the various experimental conditions. 

Of the 64 Ss finally selected, 40 were males; ages 
ranged from 17 to 31, with a median age of 23. All had 
learned English as their mother tongue, or concurrent 
with it. Questioning at the end revealed that all re- 
mained naive as to the purpose of the experiment. 

Materials. The conditioned and the generalization 
stimuli are shown in Table 1. Associated with each CS 
is a set of four generalization stimuli of high frequency, 
consisting of a word in each of four semantic relation- 
ships (homonyms, synonyms, antonyms, and “neutral” 
or unrelated control words), and a similar set of low 
frequency words. 

For the preconditioning trial, the words to be used 
later as CS and as generalization stimuli for a given S 
were interspersed among ten neutral words selected at 
random. The order of presentation of this list of words 
during the preconditioning trial was the same for all Ss. 

For the conditioning procedure, 20 high frequency 
words were selected, each of these having a synonym, a 
homonym, and an antonym. This group of words, with 
repetitions of the CS interspersed armong them, could 
be arranged alternately into three different sequences, 
thus providing three different “problems” for Ss to 
“solve” during the conditioning procedure (see below). 

For the experimental treatment in which Ss were 
given practice in an apparently unrelated task, E de- 
vised a P (for Practice) test, consisting of 28 words, each 
with its synonym, antonym, homonym, and associated 
neutral word arranged in columns in random order. 

Apparatus. The psychogalvanometer was manu- 
factured by the Lafayette Instrument Company. It 
contained a microampere galvanometer with a rated 
accuracy within 2 per cent. A vernier dial provided 
scale units of 10 ohms, and a variable shunt permitted 





SET AND SEMANTIC GENERALIZATION 493 


TABLE 1 
CONDITIONED AND GENERALIZATION STIMULI AND THEIR “G” Count FREQUENCY VALUE* 


cs Homonyms Synonyms Antonyms Neutral 

right AA write AA correct A left AA prepare AA 
rite 10 precise 6 erroneous 4 tuition 2 

buy AA by AA purchase A sell AA country AA 
bye 1 procure 22 dispense 9 robust 4 

raise A.A rays A lift AA lower AA promise AA 
raze 2 heighten 5 degrade 8 urban 4 

new AA knew AA modern AA old AA glass AA 
gnu — unfamiliar 4 obsolete 3 puppet 6 





* See reference 16. 


the measurable control of the amplification of the GSR _ necessitated rotation of these factors within each group 
changes. The range was 0 to 10M ohms. Connected by means of a latin square. The rotation scheme re- 
with the psychogalvanometer was a Lafayette Multi- quired a minimum of 4 Ss for each of the 16 groups, or 
channel Polygraph, model 603, ink recording, with a 64 Ss in all, with no replication. 

manual time marker. The GSR electrodes consisted of Procedure. During the brief preliminary period, S’s 
14-inch-square zinc plates fitted into 34-inch-diameter introduction to the GSR apparatus and to shock was 
rubber cups containing soft sponge rubber impregnated accompanied by appropriate but misleading remarks, 
with electrode paste. Both electrode contacts were on and his “base resistance level” and response to shock 
the palm of the same hand. The electrode paste con- were noted. Following this was a preconditioning trial 
sisted of powdered Bentonite, saturated CaCl, solution, exposure to the significant stimuli, which were inter- 
and glycerine. It was slow drying, showing no change spersed among a group of neutral words. Here and 


in conductivity in periods of an hour or more. throughout, each word was projected for 3 seconds, 
The stimulus words were typewritten on a thin witha 7-second interstimulus interval. 
Cellophane film placed between two sheets of red In line with suggestions made by Razran (e.g., 12), 


waxed paper (sold commercially as ‘“Radio-Mat the instructions given S during the conditioning pro- 
Slides”). Typing was done on a Vari-Type machine in cedure were designed to motivate him to attend to the 
lower case “‘Copper-Plate Gothic,” which provided _ stimuli, but to mislead him as to the purpose of the 
simple block letters of uniform height. An attempt was experiment. The CS were presented embedded in word 
made to control for the length of the words by varying sequences, and S was given the task of finding the 
the spacing between the letters. The Cellophane sheets _ basis for their arrangement. For example, in Sequence I 
were then secured in 35-millimeter Esco Slide Binders. the CS appeared between words beginning with the 
These slides were projected by means of a TDC Pro- same letter (“...dear, right, die, close, wait, right, 
jector, model D, 300 watts, fitted with a TDC Se- we...”). Some of the presentations of the CS were 
lectron Automatic Slide Projector and tray capable of _ reinforced with the US, while A kept E informed as to 
carrying up to 30 slides. A medium gray screen was whether S’s GSR approached the first criterion for 
used, 514 feet in front of S, with the words appearing conditioning (see below). When necessary, Sequences 
about 2!4 inches high in an 11-inch X 12-inch field. II and III were introduced, and cues for solving the 
Shock electrodes were 34-inch-square zinc plates at- sequences were given S, the aim being to prolong the 
tached to an adjustable elastic arm band; these were _ presentation of the words until conditioning was estab- 
about 1 inch apart. Contact was reinforced by means lished, and to terminate this part of the procedure as 
of the electrode paste. The shock apparatus drew cur- rapidly thereafter as possible. Later questioning re- 
rent from a 120-volt, 60-cycle line, which, by means of _ vealed that this goal was achieved. 
2 transformer, was changed to 480 volts at a maximum Reinforcing shock, when administered, was of 
of 40 milliamperes. A resistor and a potentiometer in _1-second duration, and the last one of the three seconds 
the circuit permitted variations in the current delivered in which the CS appeared. The first 4 or 5 CS presenta- 
to S, with a maximum of 4 milliamperes. tions were always reinforced, but thereafter reinforce- 
The S sat at a table, with E, who operated the pro- ment was partial and random, depending upon the 
jector, at his right. An assistant (A) operated the degree to which S approached the criterion. The first 
psychogalvanometer and the polygraph. During the criterion of conditioning was met when (a) a point was 
conditioning procedure, when the polygraph recorder reached at which two presentations of the unreinforced 
was not in use, A administered the shock and was able CS yielded measurable GSR’s, and the preceding and 
to signal E as to S’s responses without the latter’s following neutral words showed no response, or (b) 
knowledge with a few Ss for whom the neutral words never failed 
Experimental design. Simultaneous manipulation of _ to elicit a GSR, two presentations of the unreinforced 
three variables deriving from the three hypotheses CS gave GSR’s at least twice as strong as that of re- 
called for a 4 X 2 X 2 factorial design, or 16 groups of | sponses to adjacent neutral words. 
Ss. Control of the effects of specific conditioning and Half of the Ss were given the P test as a task os- 
generalization stimuli, and of the effects of varying  tensibly unrelated either to the conditioning procedure 
orders of presentation of the generalization stimuli, or to the test of generalization to follow. Instructions 
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were designed to induce a set for synonyms, antonyms, 
homonyms, or neutral words. The other half of the Ss 
worked at a digit-symbol substitution test for 90 
seconds, the average time taken by Ss to complete 
the P test. 

The Ss who had taken the P test were tested for 
generalization, with instructions merely to look at the 
stimuli and to wait for questions about them. The re- 
maining Ss (those who had not taken the P test) were 
given explicit instructions to look for words in one of 
the significant semantic relationships of the CS during 
the exposure of the g»: cralization series, but to wait 
for questions before id: ~.tifying the sought-for word. 

After these instructions were given, an additional 
reinforced CS was presented, followed by the generaliza- 
tion stimuli in prearranged order and, finally, a repeti- 
tion of the CS without reinforcement. The second 
criterion for conditioning was met when S’s GSR to 
the final unreinforced CS was larger than that to at 
least one of the four immediately preceding generaliza- 
tion stimuli. 

The Ss were questioned as to their identification of 
the generalization stimuli and their expectations of re- 
inforcement. The purpose of the experiment was ex- 
plained, Ss’ questions were answered, and the need for 
secrecy was emphasized. The procedure varied in 
duration from 45 to 65 minutes. 


RESULTS 


The GSR was recorded as a rising curve on 
polygraph paper. The height of each curve was 
measured in %oo-inch units and was then 
transformed mathematically into Logarithm 
Conductance Change units, as recommended 
by Haggard (6) and others. It was possible 
(4) to treat the postconditioning GSR’s as a 
function of the conditioning procedure alone, 
ignoring the preconditioning-trial. responses. 

1. Prior to conditioning, there was no sig- 
nificant difference between GSR’s to the 
stimuli to be used as CS and the mean GSR’s 
to the four words to be used as generalization 
stimuli. Following conditioning, however, the 
GSR’s to the final unreinforced CS were sig- 
nificantly greater (¢ = 6.62, p < .01) than the 
mean of the GSR’s to the four generalization 
stimuli, from which we may infer that con- 
ditioning was successfully established. 

2. The effects of the attempts to control 
three variables (CS words used, generalization 
stimuli used according to their semantic rela- 
tionships, and the position of the generaliza- 
tion stimuli in their orders of presentation) 
were investigated by means of three analyses 
of variance.’ The only significant F value was 
that deriving from the interaction of the words 


*The supplementary tables have been deposited 
with the ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington 
25, D. C. Order document number 4215 remitting in 
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POSITION IN GENERALIZATION SERIES 


Fic. 1. MAGNITUDE OF THE GENERALIZATION GSR as 
A FUNCTION OF THE POSITION OF THE GENERALIZA- 
TION STIMULUS IN THE GENERALIZATION SERIES 


used as generalization stimuli, according to 
their semantic relationships, and the position 
of these words in their orders of presentation 
(see Fig. 1). The magnitude of the response to 
synonyms decreased as a function of the posi- 
tion of the synonym in the order of presenta- 
tion; that is, the later the synonym appeared 
in the series of generalization stimuli, the 
smaller the response to it was apt to be. The 
other semantic relationships showed no clear- 
cut trends, and an analysis of the synonym 
trend indicated that it was not significant. The 
absence of significance, however, was due to 
the large within-positions value used as the 
error term. It should be noted that each of the 
16 entries of Figure 1 is the mean of 16 meas- 
ures, each measure having been obtained from 
one S in each of the 16 experimental groups. A 
more significant synonym trend might be 
found if within-position variance were reduced 
by decreasing the multiplicity of experimental 
treatments. 

3. An investigation of the relationship be- 
tween variations in experimental procedure 
and S’s general response level to stimuli re- 
vealed that the groups subjected to different 





advance $1.25 for photoprints or $1.25 for 35 mm. 
microfilm. Make checks payable to Chief, Photo- 
duplication Service, Library of Congress. (See Tables 
1, 2; 3, 4; 5, 6.) 
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treatments were not homogeneous with respect 
to response level.‘ Whether this difference is 
attributable to the effects of the experimental 
treatments themselves or to some bias in the 
selection of Ss was not investigated. 

4. In the process of testing the various pre- 
dictions arising from the hypotheses, 36 / tests 
were made. (Four were necessary to test the 
major hypothesis, 8 each to test the two minor 
hypotheses, and 16 to test the major hypothesis 
within each of the 16 experimental groups.°*) 
The number of significant, or almost signifi- 
cant, ?’s was four, just about the number one 
might expect by chance alone. We therefore 
cannot reject the null hypothesis that “‘set” as 
varied in this experiment did not affect seman- 
tic generalization. Alternative statistical treat- 
ments, involving nonparametric statistical 
techniques, yielded similar negative results. 


DIscUSSION 

We may eliminate the following factors (4) 
as possible bases for the failure to obtain posi- 
tive findings, either in the predicted direction 
or in the form of a generalization gradient: (a) 
lack of sensitivity of the apparatus to intra- 
individual differences, (6) lack of acquaintance 
of Ss with the stimuli, (c) inadequate condi- 
tioning, and (d) inadequate statistical treat- 
ment of the data. Two factors, however, may 
explain the obtained results. These are: (a) the 
pattern of reinforcement utilized, and (6) the 
degree of discrimination training achieved dur- 
ing conditioning. 

1. It might be argued that owing to differ- 
ences in the pattern of reinforcements, Riess’s 
(15) Ss expected shock when exposed to the 
generalization words, whereas in this experi- 
ment the Ss did not expect shock at this time. 
Such an explanation seems to call upon some 
form of “expectancy theory,” but is probably 
translatable into S-R terminology (8). 

2. The conditioning procedure was such as to 
maximize discrimination between the CS and 
other words. If we assume that complete dis- 
crimination did not occur in the other studies 
(there is no evidence to contradict this assump- 
tion), then gradients of generalization are at- 
tributable to failure to observe differences 
among the CS and generalization stimuli. This 


seems to be consistent with the Lashley-Wade 
‘Refer to footnote 3 regarding supplementary 
tables; see Table 7. 
5Refer to footnote 3 regarding supplementary 
tables; see Tables 8, 9, 10. 
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(9) interpretation of generalization. However, 
the Lashley-Wade position is too inconsistent 
to serve as a basis for a thorough explanation 
of generalization. From their discussion it can 
be argued® that the very process of “‘restriction 
of associations,” which is supposed to bring 
about generalization, tends to increase dis- 
crimination and so tends to decrease the “‘fail- 
ure to note distinguishing characteristics” of 
the stimuli, which is also supposed to increase 
generalization. 

Razran’s theory does not account for the re- 
sults obtained here either. Razran (12, 14) 
attributes ‘“‘true generalization” to automa- 
tized categorizing, or rating habits, of S, habits 
which increase in strength with “maturity, 
alertness . . . and other capacity-enhancing in- 
fluences,” and which are correlated, presum- 
ably, with the ability to discriminate among 
stimuli. Although the Ss here were able, for the 
most part, to identify the generalization stim- 
uli as related to—and therefore were able to 
categorize them as similar or dissimilar to—the 
CS, neither the hypothesized nor any other 
kind of generalization gradient appeared. Raz- 
ran introduces the concept of “‘pseudo-generali- 
zation”’ to account for instances of generaliza- 
tion due to lack of discrimination; but to 
accept this as an explanatory concept would 
require the additional assumption that the 
failure by Ss of other studies to discriminate 
among stimuli (and thereby producing a gra- 
dient) was the result of fatigue or lack of at- 
tention (14). 

Hullian theory (7) takes into account the 
effects of discrimination training on primary 
generalization. Although the effects of such 
training on secondary generalization are men- 
tioned by Dollard and Miller (3), the mecha- 
nism by which such training operates is not 
made clear. According to Hull, the degree to 
which differential reinforcement ‘“steepens” 
the primary generalization gradient is an in- 
verse function of the proximity of the con- 
trasted stimulus to the CS along the stimulus 
dimension. By analogy, we should expect 
minimum effects on the secondary generaliza- 
tion gradient when the contrasted stimulus is 
an unrelated word (and maximum effects when 
the contrasted stimulus is a synonym). As an 
alternative deduction from Hullian theory, it 
may be stated that secondary generalization 


6 The writer is indebted to Dr. Lawrence Karlin for 
suggesting this criticism of the Lashley-Wace position. 
g : I 
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may occur along separate dimensions for each 
of the semantic relationships—a separate gra- 
dient for synonyms, antonyms, etc. In that 
case discrimination training with antonyms 
should have no effect on generalization to 
synonyms, and discrimination training with 
neutral! words should have no effect on gen- 
eralization to other relationships. The con- 
clusion based on either of these two lines of 
reasoning is the same. Hull’s theory does not 
predict that training with neutral words will 
result in complete obliteration of the generali- 
zation gradient to related words. 


CONCLUSIONS 


In their preface, Dollard and Miller (3) state 
that most of their ideas are “hypotheses, the 
basis for research, not proven principles. Fur- 
thermore, it has not been possible to formulate 
these hypotheses in as rigorous, systematic, 
and quantitative a form as will eventually be 
desirable.” The results of this study are in 
agreement with this statement. Although it has 
been possible to derive hypotheses regarding 
semantic generalization from the work of 
Dollard and Miller and others who have been 
influenced by Hull’s thinking, existing theories 
of secondary generalization cannot account for 
the results obtained here. 

The present findings suggest several new 
problems and areas of research. Among these 
are: (a) the effects of different degrees of dis- 
crimination training on the secondary generali- 
zation gradient; (6) the effects of discrimina- 
tion training with stimuli in different semantic 
relationships of the CS on the gradient of 
secondary generalization; (c) the effects of 
different patterns of partial reinforcement on 
secondary (and perhaps on primary) generali- 
zation; and (d) the effects on generalization to 
the synonym—or any other semantic rela- 
tionship—of the position in the generalization 
series at which the crucial word appears. All of 
these problems are relevant to the use of the 
concept of semantic, secondary, mediated, or 
learned generalization as an explanation of 
what occurs in therapy or in the learning ex- 
perience of the child. 


SUMMARY 


The purpose of this study was to compare 
the influence on semantic generalization of 
variations in (a) the kinds of set given S, (5) 
the methods of inducing a set, and (c) the past 
experience of S with the generalization stimuli. 
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The findings were essentially negative, gen- 
eralization occurring randomly and without 
regard to the relationship of the stimulus to the 
CS, or of the set given S. That is, neither the 
hypothesized results nor those reported in 
the literature appeared. Current theories of 
secondary generalization are inadequate to ex- 
plain these negative findings, but several new 
areas of research are suggested. 
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CONDITIONING LEVEL IN THE BEHAVIOR DISORDERS'! 


JANET A. TAYLOR 


Northwestern University 


N MODERN psychology the clinical concept 
of anxiety can be considered to be the 
dominant and key explanatory principle 

of the behavior disorders. Freud, who is given 
major credit for originating these theories, 
stated in 1921 that anxiety was the “funda- 
mental phenomenon and the central problem 
of the neuroses” (5, p. 85). Although many 
have disagreed with Freud’s notions as to its 
source and etiology, anxiety as an explanatory 
concept has been widely accepted (e.g., 2, 3, 
7, 19), the current tendency being to regard 
the behavioral reactions of the neuroses either 
as direct expressions of anxiety or as a defense 
against this anxiety. The symptoms found in 
the functional psychoses have also been simi- 
larly analyzed in terms of their relationship to 
anxiety, although usually not in the detail 
accorded the neurotic disorders. 

The development of the anxiety concept 
rests primarily on observational evidence 
gathered from clinical data. While many of the 
postulated relationships of anxiety to overt 
neurotic and psychotic behavior appear to have 
a firm empirical basis, the concept has never 
been systematically analyzed nor have ade- 
quate experimental tests been made concern- 
ing many of the speculations about it. It is the 
purpose of this investigation to attempt to 
verify experimentally some of the assumptions 
that have been made. 


USAGES OF THE ANXIETY CONCEPT 


As the writers have pointed out in previous 
studies (13, 14) the term “‘anxiety” appears to 
be employed by the clinician in at least two 
different ways: (a) as a term applied to a par- 
ticular set of overt behavior patterns, hereafter 
referred to as “manifest anxiety,” and (5) a 
hypothetical construct, hereafter called “in- 


1 This investigation was supported by a research 
grant from the National Institute of Mental Health, of 
the National Institutes of Health, Public Health 
Service. The writers wish to acknowledge the very 
helpful cooperation of Dr. David Shakow, Chief of the 
Department of Psychology, Illinois Neuropsychiatric 
Institute, and of Dr. Samuel Liebman, Superintendent 
of the North Shore Health Resort. Acknowledgement is 
also made to George Faibish and John McAllister, who 
served as research assistants for the project. 
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ternal anxiety state,”’ that is assumed to be a 
determinant of overt behavior. 

Manifest anxiety. The definition of the con- 
cept “manifest anxiety” appears to have been 
fairly clear and unambiguous. A group of overt 
reactions, directly observable or measurable, 
are identified as manifest anxiety. This be- 
havior consists of such verbal responses as re- 
ports of chronic worry and tension, and such 
nonverbal patterns as excessive activity of 
autonomically controlled effectors, disturb- 
ances in appetite and elimination, sleepless- 
ness, panic attacks, etc. These overt behavior 
patterns are also assumed to be the accom- 
paniment, or to represent the overt expression 
of, an internal emotional state of the person. 
While attempts have been made to obtain 
physiological indices of this state, the emo- 
tionality has usually been merely assumed by 
the clinician to be present, accompanying the 
more overt, directly observable symptoms 
mentioned above. 

Internal anxiety state. In contrast to this 
concept of manifest anxiety, which is identi- 
tied with specific observable behavior symp- 
toms, the concept of internal anxiety is a 
hypothetical construct that has been intro- 
duced in connection with attempts to account 
for certain other types of behavior patterns 
that occur in the neuroses. These behavior 
patterns have typically been referred to as 
defense reactions, which supposedly protect 
the individual against the internal state of 
anxiety. According to this conception the 
individual learns to respond to emotion-arous- 
ing stimulus conditions, not with overt emo- 
tional symptoms (manifest anxiety) but in- 
stead, with certain instrumental responses. It 
is assumed, further, that these newly learned 
responses serve in some manner to avoid the 
arousal of the internal anxiety state or to re- 
duce its intensity once it has been evoked. 
Thus the presence of these defense mechanisms 
is said to result in the minimization of the 
amount of functional internal anxiety. Since 
the internal anxiety state is merely inferred 
from these defense reactions, and is not ob- 
served or measured, this second usage of 
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anxiety can be seen to be a hypothetical 
concept. 

Although on a functional level the internal 
anxiety state is held in check by the presence 
of the defenses, it should be noted that the 
individuals exhibiting such symptoms are still 
considered to be potentially anxious. That is, 
if the defenses are somehow prevented from 
occurring, the stimulus conditions previously 
giving rise to the defense reactions would now 
arouse the internal anxiety. Furthermore, this 
anxiety would produce the type of overt be- 
havior that is described as manifest anxiety. 
This type of interpretation would seem to 
identify the internal emotional state assumed 
to be operating at the time of manifest anxiety 
with the internal anxiety state postulated to 
lie behind the appearance of defense reactions. 

In the typical textbook treatment, the rela- 
tionship between internal anxiety and overt 
symptomatology is described for each behavior 
disorder separately. It can be seen from these 
discussions that the assumption has not been 
made by clinical theorists that defenses against 
internal anxiety operate in uniform fashion 
and with equal effectiveness in all diagnostic 
groups, or that the presence of defense reac- 
tions precludes the appearance of some mani- 
fest anxiety responses. In obsessive-compul- 
sive disorders, for example, manifest anxiety is 
frequently observed, disappearing only when 
the obsessive thought and/or compulsive act 
have occurred. In the classical hysterical syn- 
drome, on the other hand, manifest anxiety is 
almost completely absent (“la belle indiffé- 
rence”). Furthermore, the internal anxiety 
postulated to lie behind the symptoms is kept 
at a minimal level. Thus, the hysterical de- 
fense system may be considered to be par- 
ticularly effective in checking internal anxiety 
and the obsessive-compulsive syndrome, in 
comparison, far less so. 

Within the neurotic disorders the symptoms 
of only one group, anxiety neurotic (anxiety 
state), have not been assumed to have a de- 
fensive function. In fact, this group is often 
characterized as one in which defense reactions 
have failed to develop. That is, individuals are 
classified in this category on the basis of the 
presence of a large number of manifest anxiety 
symptoms and the absence of any well-inte- 
grated defense system. Thus, while the anxiety 
neurotic has a few scattered defense reactions, 


Janet A. TAYLOR AND KENNETH W. SPENCE 





he stands in distinct contrast to individuals in 
other diagnostic groups, being relatively un- 
protected against his anxieties. 

As we have seen, anxiety plays a key ex- 
planatory role in the neuroses. The overt symp- 
toms in these disorders, if not direct expressions 
of internal anxiety, are defense reactions 
against it. Thus, the major differences among 
subgroups are in the particular protective 
mechanism that is used to reduce the anxiety. 

Anxiety has been assigned a similar role in 
the functional psychoses: many symptoms in 
these disorders are assumed to have defensive 
properties with respect to anxiety. However, 
in contrast to the neuroses, the anxiety con- 
cept as an explanatory principle occupies a 
relatively minor position; anxiety and reactions 
to it are considered as being only one segment 
of a multifaceted phenomenon. Furthermore, 
the theories concerning the dynamics of these 
disorders are insufficiently detailed to permit 
direct comparison of neurotic and psychotic 
groups with respect to amount of internal 
anxiety. For example, clinical theory would 
permit us to hypothesize that the anxiety 
neurotic is more likely to be anxious at any 
moment of time than the obsessive or the 
hysteric, but would not provide a basis for 
similar judgments among psychotic groups or 
between psychotic and neurotic Ss. 


AN EXPERIMENTAL TEST 


A number of studies have appeared in the 
literature that have shown a relationship be- 
tween level of classical conditioning and mani- 
fest anxiety variously defined, in both normal 
Ss (1, 6, 11, 12, 13, 14) and psychiatric patients 
(9, 17, 18). The interpretation of these results 
offered by the writers (12, 14) has been that 
degree of manifest anxiety is related to the 
intensity of an internal anxiety (emotional) 
state and that this, in turn, reflects the level 
of drive (D) of a subject. Thus, according to 
an analysis in S-R terms, Ss exhibiting much 
manifest anxiety should, by virtue of higher D, 
exhibit a higher level of conditioning per- 
formance than those with lower manifest anx- 
iety. The assumption that manifest anxiety 
reflects drive level has also been supported by 
the results of a number of investigations in- 
volving learning tasks other than classical 
conditioning (4, 8, 16). 

Attention was directed to the fact that clini- 
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cians have tended to postulate that the internal 
anxiety that produces manifest anxiety symp- 
toms is the same state as that responsible for 
evoking defensive reactions. If we follow this 
assumption, we see that the established relation 
between manifest anxiety and conditioning 
provides a basis for testing the amount of 
functional internal anxiety that is present in 
patients who do not exhibit manifest anxiety. 
That is, differences among subgroups of pa- 
tients in level of conditioning could be inter- 
preted as reflecting differences in degree of 
functional internal anxiety. 

In an earlier study by the writers (13) in- 
volving comparisons between normal, neurotic, 
and psychotic Ss, conditioning scores were used 
in this manner. Clinical speculations led to the 
prediction that psychotic and neurotic Ss 
would, in general, exhibit a higher condition- 
ing level than normals due to the greater degree 
of manifest and internal anxiety in these in- 
dividuals. The results of this investigation 
indicated that the conditioning performance of 
the psychotic Ss was significantly higher than 
that of both normals and neurotics. The neu- 
rotic group was superior to the normal Ss but 
the difference was not statistically significant. 

The purpose of the present experiment was 
to test further implications of clinical theory. 
Specifically the prediction was made that Ss 
with the diagnostic label ‘anxiety state” 
should show higher conditioning scores than 
Ss in other neurotic groups in which defense 
behavior is said to predominate. 


METHOD 


Subjects. The Ss were psychiatric patients obtained 
from the inpatient and outpatient services of the Illinois 
Neuropsychiatric Institute and from the North Shore 
Health Resort. Patients were used as Ss in the experi- 
ment as they became available without any prior 
selection or screening as to diagnosis. Some screening 
was involved, however, in that the psychiatrists re- 
ferred only those patients whom they judged to be 
capable of cooperating in the experimental situation 
and who would not be excessively disturbed by the 
experience. 

Psychiatric diagnoses for the Ss were drawn from 
the individual files maintained by each institution. In 
addition to the diagnosis, other types of information 
were obtained for each patient. These included items 
of personal data (age, occupation, onset of illness, etc.) 
drawn from case histories and a psychiatric check list 
that was completed for each patient by the psychiatrist 
in charge of the case. The check list consisted of state- 
ments describing common behavior patterns found in 
the functional behavior disorders, including a detailed 
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TABLE 1 


DISTRIBUTION OF DIAGNOSES IN THE NEUROTIC AND 
Psycuotic GROUPS 


Neurotics 


Diagnosis 
Anxiety state 
Anxiety hysteria 
Obsessive-compulsive 
Depressive reaction 
Hysteria (misc.) 
Hypochondriasis 


Psychotics 
Diagnosis 


Schizophrenia 
Simple 
Paranoid 
Hebephrenic 
Mixed type 
Involutional psychosis 


Psychoneurosis mixed Miscellaneous 
Alcoholism 
Drug addiction 
Behavior & character 
disorders 
Total 


list of overt anxiety symptoms. The psychiatrist was 
asked to indicate absence or severity of each symptom 
if present, and further, to assign to each patient an 
over-all rating as to the degree of overt anxiety. In 
addition, an individual form of a manifest anxiety 
scale developed by one of the writers (15) was admin- 
istered to each patient, usually several days after the 
conditioning session. 

One hundred and forty-five patients participated in 
the conditioning situation. The records of 29 of these 
were excluded from consideration in the results of the 
investigation for the following reasons: 6 for equipment 
failure, 3 due to inability or unwillingness to cooperate 
in the conditioning situation, 2 because of unavail- 
ability of any diagnosis, and 18 because they did not 
meet the latency criteria set up to exclude voluntary 
responders. Of the 116 remaining Ss, 74 were diagnosed 
as neurotic and 42 as psychotic.? A more detailed break- 
down of the composition of the neurotic and psychotic 
groups by diagnostic category is shown in Table 1. 

Apparatus and method of recording. The equipment 
for recording the eyelid closure and presenting the CS 
and UCS has previously been reported (13) and will 
not be described here. Seated in a straight-backed chair 
in a semi-darkened room containing the conditioning 
apparatus, S was instructed to blink upon receiving a 
verbal ready signal and then to fixate on a 6-cm. circu- 
lar milk glass window or disc, placed at a distance 6 ft. 
in front of him. The brightness of the disc between 
trials was approximately 1 apparent foot-candle. The 
CS consisted in an increase in brightness of the disc 
to approximately 24 apparent foot-candles. The dura- 
tion of the CS on each trial was 520 msec. Following the 
onset of the CS by 470 msec., the UCS, a puff of air, 
was administered to the right eye. The air puff was 
produced by the fall of a 30-mm. column of mercury, 
magnetically released, in a manometer. Intertrial inter- 





2 Of this total, the data from 34 neurotic and 21 
psychotic Ss were presented in an earlier study (13) 
by the writers. Additional Ss were tested and the entire 
group utilized in the analyses reported in the present 
investigation. 
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vals of 15, 20, and 25 sec., averaging 20 sec. and fixed 
according to a prearranged schedule, were used. 

Studies in the Iowa laboratory have shown that 
some Ss attempt to avoid the air puff by voluntarily 
closing their eye to the CS, keeping it closed until after 
the occurrence of the UCS. In order to eliminate such 
voluntary responders, objective latency criteria utilized 
in previous studies (10, 13) were employed. Briefly, the 
records of all Ss who gave more than 50 per cent CR’s 
with latencies less than 300 msec. were excluded from 
the sample. 

Conditioning procedure. Each S first received three 
presentations of the light alone in order to check for 
any initial conditioned response tendencies to the 
light. A single puff of air, unaccompanied by the light, 
was then administered and spontaneous blinking re- 
corded for 40 sec. Following these preliminary trials, 
all Ss were given 80 paired presentations of the light 
and air puff. 


RESULTS AND DISCUSSION 


Comparison of neurotic groups. In comparing 
the conditioning scores of anxiety neurotics 
with those in other neurotic groups a question 
arose, as to whether or not certain diagnostic 
categories should be included in the analysis, 
namely: behavior disorders, alcoholism, drug 
addiction, and mixed psychoneurosis. The latter 
diagnosis is typically a ‘“‘wastebasket”’ category 
into which individuals failing to correspond to 
any of the other neurotic patterns are placed. 
Since many of these Ss may be expected to 
exhibit a number of overt anxiety symptoms 
in addition to other reactions, opposing this 
group to anxiety neurotics might not be con- 
sidered a fair test of the clinical theory. The 
other three groups (alcoholics, behavior dis- 
orders, and drug addicts) likewise raised a 
problem since the relationship of their behav- 
ior to anxiety has not been clearly specified. 
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Accordingly, two analyses were made between 
the conditioning scores of anxiety neurotics 
and those of other groups, one including all 
other neurotics and the other with the four 
questionable categories omitted. 

Curves of acquisition of the CR for anxiety 
neurotics and the other groups are shown in 
Fig. 1. Although the curve for anxiety neurotics 
tends to be above that for all other neurotics, 
as would be predicted by clinical theory, there 
was some overlapping. The mean conditioning 
scores for anxiety neurotics were also slightly 
higher than for all other neurotics, as is shown 
in Table 2. The difference between means, 
however, was not statistically significant. 

When the scores for the individuals in the 
four categories previously mentioned were ex- 
cluded from the nonanxiety category, the 
conditioning scores for this group were de- 
creased. Although these Ss (‘‘other neurotics’’) 
exhibited a lower conditioning performance on 
every block of trials than did anxiety neurotics, 
the ¢ ratio comparing the mean conditioning 
of this reduced group with anxiety neurotics 
was only 1.22, a value falling far short of the 
level needed to reject the null hypothesis. 

In attempting to explain the failure to find 
significant differences between the neurotic 
groups a check was made on the fundamental 
assumption that the presence of the defense 
reactions precludes, or at least reduces, the 
appearance of manifest anxiety symptoms. On 
a questionnaire filled out for each patient by 
his psychiatrist the patient was classed into 
one of three groups according to the over-all 
amount of manifest anxiety exhibited: absent 
or mild, medium, or severe. (All judgments 
were made in terms of a patient population.) 
A chi-square test of independence indicated 
that there was no significant difference between 
the number of anxiety neurotics and all other 
neurotics assigned to each of these categories. 
When the four questionable groups were re- 
moved, the anxiety ratings for the latter group 
decreased slightly but the value of chi square 
again failed to reach statistical significance. 
A comparison of means on the manifest anx- 
iety scale, described earlier, also failed to show 
differences, the mean of the anxiety neurotics 
being 30.00 and of all other neurotics 30.96. 
Since the groups were similar in these measures 
of overt anxiety, it is not surprising that their 
conditioning performance also did not differ 
significantly. 
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The results of the present investigation, 
then, do not conform to the predictions devel- 
oped from the clinical hypotheses concerning 
the significance of certain neurotic symptoms. 
Not only was there failure to support the 
inference that these symptoms reduce internal 
anxiety, as shown by the conditioning data, but 
also the analyses of the estimates of manifest 
anxiety indicated that the groups did not differ 
in this respect. In this group of patients at 
least defensive neurotic symptoms seemed to 
be merely added to manifest anxiety reactions 
and did not replace them. 

A number of qualifications could be added 
to the defense theory which would leave it 
essentially intact and still explain the present 
findings. Individual differences might be ex- 
pected in the degree to which a defense system 
was perfected. For example, patients whose 
symptoms have appeared only recently might 
still be in the process of developing them and 
hence be more anxious than persons who had a 
long history of illness; some individuals might 
be attempting to give up their neurotic symp- 
toms and others might be content with them. 
Failure to take such variables into account 
might obscure the relationship of well-devel- 
oped defenses to anxiety. Or it might be 
claimed that in terms of the number and effi- 
ciency of the defenses, neurotic disorders lie 
along a continuum in such a fashion that 
opposing one extreme group with all others, as 
was done in the present study, will reveal no 
differences in either manifest or internal anx- 
iety unless extremely sensitive measuring in- 
struments are usec. Whatever the explanation, 
a reasonable conclusion to be drawn from the 
results of the present study is that the clinical 
hypotheses describing the significance of neu- 
rotic behavior are, at best, oversimplified and 
need elaboration. Further research, testing 
such speculations as those offered above, is 
certainly indicated. 


TABLE 2 


MEAN PERCENTAGE OF CONDITIONED RESPONSES 
EXHIBITED BY NeEvROTIC GROUPS 
IN 80 CONDITIONING TRIALS 


Group N 





Anxiety neurotics 

All other neurotics 

Other neurotics minus 
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NUMBER OF CR’s FOR THE NEUROTIC AND PSYCHOTIC 
Groups IN SUCCESSIVE BLOcKS OF TEN TRIALS 


Psychotics vs. neurotics. The Ss in the present 
investigation include 34 neurotic and 21 psy- 
chotic individuals whose conditioning per- 
formance was reported by the writers in 
another study (13), plus an additional num- 
ber of Ss tested at a later time. In the original 
study the psychotic Ss, when compared to 
neurotics, exhibited a significantly superior 
conditioning performance. In order to deter- 
mine whether this difference would still be 
found with a larger sample from each popula- 
tion, conditioning scores for the groups were 
again compared. The conditioning curves 
shown in Fig. 2 indicate that the psychotic 
group continued to exhibit a consistently 
higher performance throughout the course of 
the conditioning trials. However, as may be 
seen from Table 3, the results of a ¢ test indi- 
cated that the difference between means for 
the total conditioning scores was significant 
only between the .05 and .10 level of confi- 
dence, somewhat lower than was found in the 
earlier study. 

The two groups were also compared with 
respect to the data from the psychiatric ques- 
tionnaire and the manifest anxiety scale. For 


TABLE 3 


MEAN PERCENTAGE OF CONDITIONED RESPONSES ON 
80 CONDITIONING TRIALS 


Group N 


Mean ou 


Neurotics 74 26.69 2.98 


1.82* 
35.89 


Psychotics 42 4.11 





* Significant between the .05 and .10 level of confidence for 
104 df, 
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the former, a chi-square test of independence 
indicated no difference in the proportion of 
neurotic and psychotic Ss in each of the three 
overt anxiety categories. A comparison of the 
means on the manifest anxiety scale also re- 
vealed no significant difference between groups. 
Thus, it would appear that if the superiority 
of the psychotic group in conditioning were to 
be interpreted in terms of anxiety, the differ- 
ences would be in the amount of functioning 
internal, rather than manifest, anxiety. Ac- 
cording to this interpretation, psychotic 
symptoms mask or partially prevent the ap- 
pearance of overt anxiety responses without a 
corresponding reduction in the amount of 
internal anxiety. 

A further possibility that should be men- 
tioned is that the greater conditioning per- 
formance of psychotics may be due to factors 
other than anxiety) While previous studies 
have demonstrated a relationship between 
conditioning and manifest anxiety, variously 
defined, the correlation coefficients that have 
been reported between these two variables 
(6, 13) indicate that a relatively small amount 
of the variability among Ss can be accounted 
for in terms of anxiety. Thus, it is logically 
possible that the superiority of psychotics over 
neurotics, and, as was found in an earlier 
study (13) over normals, is wholly or partially 
due to as yet unidentified factors that separate 
psychotic individuals from other groups. 


SUMMARY 


The present study was designed to investi- 
gate differences in conditioning performance 
between anxiety neurotics (anxiety state) and 
neurotics in other diagnostic categories, and 
between neurotics and psychotics. Seventy- 
four neurotic Ss (including 15 anxiety neu- 
rotics) and 42 psychotic Ss were run in an eye- 
lid conditioning situation under the same ex- 
perimental conditions. 

The conditioning performance of the anxiety 
neurotics and other neurotics did not differ 
significantly. Further, no difference between 
the two groups was found in the amount of 
overt anxiety exhibited, as measured by a test 
of anxiety and a psychiatric questionnaire 
filled out for each patient. The implications of 
these results for clinical hypotheses concerning 
the function of certain neurotic symptoms 
were discussed. 

When the psychotic Ss were compared to the 
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total neurotic group, their conditioning per- 
formance was found to be superior between 
the .05 and .10 level of confidence. Speculations 
concerning this difference were offered. 
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THE ROLE OF EXPECTANCY IN THE PERFORMANCE 
OF POSTHYPNOTIC BEHAVIOR! 


SEYMOUR FISHER 
Army Medical Service Graduate School 


ESPITE a wealth of anecdotal material 
and case reports, there have been few 
experimental investigations of the 

performance of posthypnotic behavior. Aside 
from an important contribution by Erickson 
and Erickson (3) and several studies dealing 
with the duration of posthypnotic suggestions 
summarized in Weitzenhoffer’s paper (8), the 
major variables remain essentially unexplored. 

The purpose of the present communication 
is to report the results and implications of a 
simple, but relatively well-controlled, experi- 
ment on posthypnotic suggestion. The primary 
variable under investigation was the nature of 
the setting in which the suggestion was carried 
out, and an attempt was made to manipulate 
the subject’s (S’s) expectancies after the ter- 
mination of the formal hypnotic state. Thus, 
the general hypothesis being tested is that 
successful performance of a posthypnotic act 
is not only a function of the specific sugges- 
tions given under hypnosis, but is also con- 
tingent upon the S’s subsequent expectations. 


METHOD 

Subjects 

A total of 13 Ss was employed in this study. All were 
capable of attaining either a medium or deep hypnotic 
state, and all had previously shown that they could re- 
spond to simple posthypnotic suggestions. Twelve Ss 
were males who had earlier participated in a separate 
hypn»tic experiment; the thirteenth S was a high school 
girl who had had only a few hours of hypnotic experi- 
ence prior to this experiment. 


Procedure 

While under hypnosis each S was given the following 
posthypnotic suggestion, which was generally repeated 
two or three times: 

“Now listen carefully. After I awaken you, you'll 
notice a very interesting thing occurring. Every time 
you hear the word ‘psychology’ you’ll reach up and 
scratch your right ear. You will be wide awake, and 
aware of everything that’s going on, but each time the 
word ‘psychology’ is mentioned, you’ll feel compelled 
to scratch your right ear. Do you understand that 
clearly?” 








1 Based on one portion of a dissertation submitted to 
the faculty in partial fulfillment of the requirements for 
the Ph.D. degree at the University of North Carolina, 
1952. 


During the administration of the suggestion, S was 
alone in the room with the hypnotist (hereafter referred 
to as £,). At the conclusion of the suggestion, S was 
awakened and the experiment proceeded as follows: 

Period 1. One or more experimenters (associates of 
the hypnotist who were collaborating in the study) 
entered the room and were introduced to S. A general 
discussion ensued which included S, E,, and whoever 
else was present. Each of the various Es managed to 
weave into the conversation the word “psychology” a 
number of times (a minimum of five), and also at- 
tempted, by means of indirect questioning, to get S 
himself to say the word. While the discussion was car- 
ried on in a friendly, informal, nonexperimental at- 
mosphere, it was apparent at all times that the Es 
were taking note of S’s ear-scratching behavior. 

Period 2. After the discussion had gone on about ten 
minutes, an attempt was made to alter inconspicuously 
the nature of the situation. The other Es temporarily 
dropped out of the conversation, and E£, began to chat 
with S about the posthypnotic suggestion which had 
been given him under hypnosis. The ancillary Es soon 
joined in this discussion, and during the next five or 
ten minutes every effort was made to create and 
sustain the impression that the “experiment” was com- 
pleted—i.e., that the Es had satisfactorily witnessed 
the effects of the posthypnotic suggestion, and were 
now no longer interested in ear-scratching behavior per 
se. Throughout this period, however, the stimulus word 
“psychology” was again mentioned by all the Es on 
numerous occasions. 

In order to clarify the technique used during Period 
2, the following is a truncated account of the conversa- 
tion with one S: 

E,: “Have you noticed anything unusual during the 
past few minutes, Jim?” 

S: “Why, yes, I’ve been scratching my ear every 
once in a while.” 

E,: “Uh-huh, as you may have guessed, you were re- 
sponding to a posthypnotic suggestion I gave you 
earlier. Did you notice when you were scratching your 
ear?” 

S: “It seems like it was to a word or something.” 

E,: “That’s right. I had given you the suggestion 
that every time you heard the word ‘psychology’ you 
would scratch your ear. How did you feel the first time 
you heard the word ‘psychology’?” 

S: “Well, I just found my hand reaching up.” 

E,: “Yes, posthypnotic suggestions work that way. 
All that’s necessary is to tell you while you’re in hyp- 
nosis that you’ll scratch your ear whenever you hear 
the word ‘psychology,’ and sure enough you find your- 
self doing it.” 

The conversation then continued along the same 
lines with the other Es joining in. 

Period 3. As an additional control, EZ; now en- 
deavored to restructure S’s expectancies by intimating 
that the experiment was still in progress. The hypnotist 
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* This S failed to respond the first three times the stimulus 
word was presented, and then suddenly began to respond. 
¢ This S held her hand against her ear throughout the second 


period. 


informed S that he was going to rehypnotize him and 
remove the suggestion 

“All right, Jim, in a few moments I’m going to 
hypnotize you once again and remove the suggestion I 
gave you. I’m sure you wouldn’t want to go around 
scratching your ear indefinitely. Before I put you back 
in hypnosis, though, I was just wondering . ..do you 
remember the name of the last psychology book you 
bought?” 

The other Es in the room then presented the stimulus 
word a few times (both in sentence form and as a single, 
isolated word), and finally S was directly requested to 
say the word himself. Regardless of whether or not he 
responded to the stimulus, S was subsequently re- 
hypnotized and the suggestion was permanently re- 
moved 

At a later date, ranging from one day to a week, 
those Ss who failed to respond during Period 2 were 
interviewed individually by £; alone. In this interview 
interest was focused upon S’s attempts to account for 
the differences in behavior between Periods 1 and 2. 


RESULTS 


Table 1 records the presence (+) or ab- 
sence (—) of the ear-scratching behavior in 
response to the stimulus word “‘psychology”’ as 
presented by E£, during the three periods of 
the experiment. As was readily expected, all 
thirteen Ss responded positively during Period 
1.2 Under the altered conditions of Period 2, 
however, nine Ss completely stopped respond- 
ing; two Ss (Bu and Bl) continued to respond 


2? Although most Ss were completely consistent in 
either responding or not responding during any given 
period, two Ss during Period 1 missed the cue on single 


occasions 
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in the same manner as they did in Period 1; 
and one S (Me) confused the issue considerably 
by keeping her hand against her ear through- 
out the second period, making it difficult to 
determine whether or not she was responding. 
One S (He) failed to respond in Period 2 the 
first three times the stimulus word was men- 
tioned by £,, but then abruptly resumed the 
response and continued for the remainder of 
the period. 

Table 1 further indicates that, in Period 3, 
seven of the nine Ss who ceased responding in 
Period 2 reverted to the ear-scratching be- 
havior in the absence of any intervening formal 
hypnosis. 

Of particular interest are the data pertaining 
to the variability in response consistency. That 
the instructions were interpreted individualis- 
tically by different Ss is clearly seen in an 
examination of Table 2. While ail the Ss with- 
out exception responded in Period 1 when the 
stimulus word originated from £, four Ss did 
not respond when they themselves uttered the 
word; two of these four, moreover, failed to 
respond when any of the ancillary Es pre- 
sented the cue. Table 2 shows that five Ss 
could not be induced to say the word “psy- 
chology,”’ and it is noteworthy that all Ss 
manifested a decided reluctance to say the 
word. 

One of the most striking findings was ob- 
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tained in the postexperimental interview when 
the Ss were confronted with the fact that they 
had stopped scratching their ear during Period 
2. Of the nine Ss interviewed, two (Co and Ro) 
offered elaborate ‘“‘rationalizations’”’: one said 
he noticed his ear itching severely each time 
the word was mentioned, but could not scratch 
it because his hands were in his pockets (which 
was untrue); the other claimed he was unable 
to respond because he had a book in his hands 
at the time (equally untrue). The remaining 
seven Ss insisted they did not hear the word 
mentioned during Period 2! 


DISCUSSION 


Nature of posthypnotic behavior. A compari- 
son of the behavior in the three experimental 
periods strongly suggests that the performance 
of a posthypnotic suggestion is a function of 
S’s understanding of the hypnotist’s expecta- 
tions—not only when the suggestion is given 
during the formal hypnotic state, but also in 
the process of enacting the suggestion. In 
Period 1 Ss responded each time the signal 
was perceived, and it may be safely assumed 
that this behavior would have continued if the 
hypnotist had not restructured the situation 
in Period 2. Although S was allegedly “‘wide 
awake,” obviously some aspect of the hyp- 
notist’s behavior affected S’s posthypnotic 
behavior. To the extent that the hypnotist was 
successful in manipulating the experimental 
periods, it seems reasonable to assume that 
Ss who failed to respond in Period 2 were led 
to believe that the ear-scratching behavior was 
no longer expected. While the actual suggestion 
stipulated nothing about the termination of 
the behavior, and specifically directed S$ to re- 
spond every time the word was mentioned, the 
majority of the Ss were, so to speak, “caught 
off guard.”” When the hypnotist implied at the 
beginning of Period 3, however, that the sug- 
gestion was supposed to be still operative, Ss 
tended to revert to the posthypnotic act.* 


3 Further support for the validity of this interpreta- 
tion is found in a hypnotic interview held with one S 
(He) at the conclusion of the experiment. When asked, 
under hypnosis, to account for the fact that in Period 
2 he failed to respond the first three times the word 
was presented but then abruptly began to respond 
again, after extensive questioning he stated that he 
suddenly realized that “the experiment wasn’t over 
yet.” One might hypothesize that had it been possible 
to ctructure the situation even more precisely during 
Period 2, all 13 Ss might have been “caught off guard.” 
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The fact that a posthypnotic suggestion is 
carried out only in a specific context has never 
been sufficiently recognized in the literature. 
Indeed, it has been generally implied that, how- 
ever unexpectedly a posthypnotic signal may 
arise, the good S finds himself responding 
(1, 4, 6). 

A number of studies indicate that when an 
S is given a posthypnotic suggestion without 
a terminal cue (i.e., indicating when S should 
stop responding), the behavior tends eventu- 
ally to cease as a function of time (e.g., 5, 7). 
That this behavior, however, can be “turned off 
and on” by controlling S’s expectations in the 
subsequent “waking” state seems to be a hith- 
erto unrecognized phenomenon which refocuses 
the entire problem of posthypnotic-suggestion. 
In effect, the question may be posed: Is S in 
some form of hypnotic state during the folal 
time that he is prepared to respond post- 
hypnotically, even when the behavior extends 
over a period of days without any intervening 
formal hypnosis? The answer to this question 
is certainly not a simple one, but will be 
touched on very briefly here. 

Erickson and Erickson (3) have elaborated 
on Moll’s earlier observation (6) that S fre- 
quently appears to pass into a spontaneous 
hypnotic state immediately preceding the sug- 
gested posthypnotic act; this spontaneous state 
terminates as soon as the behavior is com- 
pleted, and S resumes his ‘“‘normal”’ activity. 
The specific manifestations of the state are 
thoroughly described by the Ericksons, who 
also point out that this new state of hypnosis 
can be most clearly demonstrated by pro- 
pitiously interrupting S in the process of per- 
forming the posthypnotic act: S will now heed 
the hypnotist’s suggestions once again, al- 
though both before and after the response it is 
claimed that S is no more suggestible to the 
hypnotist’s requests than is a nonhypnotic 5S. 

In the present study the manifestations of 
the spontaneous, self-limited state were clearly 
observed*—but only on each occasion when S 
began to reach up to scratch his ear. That is, 
during the greater part of the interaction be- 
tween S and Es, there were no indications of 


4 Amnesia for the ear-scratching behavior, however, 
did not occur even in those Ss who were ordinarily 
amnesic for hypnotic events. It is probable that the 
discussion during Period 2, in which specific reference 
was made to the ear-scratching, contributed to the lack 
of subsequent amnesia. 








506 


any hypnotic-like state; whenever the stimulus 
word was effective, however, the new hypnotic 
state became momentarily evident until S 
finished scratching his ear. Now if it is assumed 
that Ss were “wide awake and normal” in be- 
tween the ear-scratching responses, then it is 
dificult to see why the behavior changed in 
Periods 2 and 3. When the hypnotist restruc- 
tured the setting at the beginning of Period 2, 
there were no indications of a hypnotic state 
at that moment; and certainly when Ss re- 
sumed the ear-scratching behavior in Period 
3, no formal hypnosis was instigated. 

The results of this study suggest, rather, 
the view that during the entire time S remained 
in the setting after being told to awaken, a 
certain degree of rapport existed between him 
and the hypnotist. This rapport was quite 
specific, it appears, in the sense that S con- 
tinued to be concerned about the hypnotist’s 
expectations regarding the posthypnotic be- 
havior previously suggested. In no other way 
did he appear hypersuggestible. 

This concern over the hypnotist’s expecta- 
tions is further seen in the denial behavior ob- 
served during the postexperimental interview 
with the hypnotist. Although the tendency for 
hypnotic Ss to rationalize a posthypnotic sug- 
gestion is commonplace, it appears that a 
hypnotic S is equally prone to rationalize his 
lack of posthypnotic behavior—particularly if 
he believes he has failed to live up to the hyp- 
notist’s expectations.’ Moreover, when one 
considers the fact that these denials and ra- 
tionalizations were obtained from some Ss as 
long as one week after the last formal hypnotic 
session, it becomes apparent that the line of 
demarcation between termination of hypnosis 
and “‘normal waking behavior’’ is a hazy one 
indeed. Some form of rapport seems to persist 
between S and the hypnotist since the defense 
mechanisms manifested by Ss reflect a degree 
of ego involvement which would not be ex- 
pected in the absence of a strong (hypnotic?) 
relationship. One rather obvious question 
comes to mind: Does S ever completely resume 
his prehypnotic, “normal” state, where he is 
no longer affected by the hypnotist? A promis- 
ing area of research might deal with the type 
and duration of the subject-hypnotist relation- 


* By implication, it follows that introspective re- 
ports of Ss who have been used in hypnotic experi- 
mentation may be of dubious validity and should be 
interpreted with extreme caution. 
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ship after conclusion of formal hypnotic 
sessions. 

Interpretation as a function of expectancy. In 
a paper by Erickson (2) it is claimed that the 
hypnotic S interprets the hypnotist’s sugges- 
tions in a concrete, literal manner. The present 
data are clearly not in line with her contention, 
since Ss do not attach any literal meaning to 
the phrase “‘every time the word ‘psychol- 
ogy’ is mentioned.” Instead, it would seem 
that whatever interpretation is given to the 
explicit suggestion depends upon what S be- 
lieves the hypnotist intended. It will be recalled 
that when the suggestion was presented during 
hypnosis, Ss did not know that additional 
people were to be later introduced. In Period 
1, after S is awakened, it is almost as though 
he has to decide “does he (the hypnotist) ex- 
pect me to respond to these people also?” The 
fact that eleven out of the thirteen Ss did re- 
spond to the ancillary Es may possibly be 
explained by noting that these Es were pre- 
sented to Ss by the hypnotist and might thus 
be considered as “associates.” The prediction 
could be made that if the hypnotist ignored, 
or in some way appeared antagonistic toward, 
the other Es, fewer Ss would respond to them 
as a source of stimulation. 

Similarly, it seems doubtful whether it oc- 
curred to Ss during the formal hypnosis (when 
they were given the posthypnotic suggestion) 
that they themselves might be asked to present 
the stimulus word. The peculiar difficulty in 
getting Ss to say the word “psychology” dur- 
ing Period 1 cannot be readily accounted for 
by the present data, but is consistent with the 
possibility that Ss were so uncertain as to the 
hypnotist’s expectations on this score that they 
were reluctant to have to commit themselves 
one way or the other. This phenomenon is an 
interesting one, and warrants further research. 


SUMMARY AND CONCLUSIONS 


The primary purpose of this study was to 
investigate the extent to which the effective 
enactment of a posthypnotic suggestion is de- 
pendent upon the nature of the setting in 
which the suggestion is carried out. On the 
basis of data obtained from thirteen hypnotic 
Ss, the results were as follows: (a) The con- 
tinued performance of a posthypnotic sugges- 
tion is a function of S’s belief that the hypnotist 
expects the behavior to occur; i.e., if S is 
“caught off guard,” the posthypnotic behavior 
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tends to disappear. Hewever, additional ma- 
nipulation of the situation by the hypnotist 
can reinstate the behavior in the absence of 
any further formal hypnotic suggestion. (6) 
The hypnotic S does not usually tend to in- 
terpret the hypnotist’s suggestions in a con- 
crete, literal manner, but rather, the eventual 
meaning appears to be determined by S’s 
inferences as to the hypnotist’s expectations. 
(c) An incidenta! finding of some interest in- 
dicates that Ss who are “‘tricked”’ into a lapse 
of carrying out a posthypnotic suggestion seem 
to find it necessary to “save face,” either by a 
complete denial of their “‘negligence’”’ or by a 
more elaborate form of rationalization. The 
tendency for hypnotic Ss to rationalize their 
posthypnotic behavior is well-established in 
the literature; it appears, however, that hyp- 
notic Ss are equally capable of rationalizing 
their lack of posthypnotic behavior. 

The results suggest that when an S is carry- 
ing out a repetitive posthypnotic suggestion 
over an extended period of time, a certain 
degree of rapport continues to exist between 
him and the hypnotist. This rapport is evi- 
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denced by the S’s unceasing concern over the 
hypnotist’s expectations in regard to the pre- 
viously suggested posthypnotic behavior. 
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SELF-PERCEPTION, PERCEPTUAL DEFENSE, AND ADJUSTMENT! 
BERNARD CHODORKOFF 


University of Wisconsin?® 


N THE personality theories of Rogers (12, 

13) and Snygg and Combs (2, 14), defensive- 

ness is described as primarily a perceptual 
phenomenon which follows as a consequence 
of threat to the individual’s self. Defense, in 
essence, is the prevention of accurate percep- 
tions of what is threatening from reaching 
awareness. As a result, aspects of the environ- 
ment and of the person himself may be denied 
to awareness or may be misperceived. It is 
in this way that the individual insures the sta- 
bility of his self. Furthermore, the adequacy 
of the individual’s personal adjustment is 
considered to be inversely related to the degree 
to which experiences are denied awareness. 

The present research was designed to study 
the implication that the degree of defensive- 
ness and its perceptual consequences will be 
the same, regardless of whether you select for 
examination the individual’s perception of his 
environment or his perception of himself, and 
that both will be related to the adequacy of 
the individual’s personal adjustment. 

The hypotheses which were set up to be 
tested may be stated as follows: 

1. The greater the agreement between the 
individual’s self-description and an objective 
description of him, the less perceptual defense 
he will show.’ 

2. The greater the agreement between the 
individual’s self-description and an objective 
description of him, the more adequate will be 
his personal adjustment. 

3. The more adequate the personal adjust- 
ment of the individual, the less perceptual 
defense he will show. 


1 This article is based on a thesis submitted in 
partial fulfillment of the requirements for the degree of 
Doctor of Philosophy at the University of Wisconsin. 
The author expresses his gratitude to Dr. E. E. Baugh- 
man for his supervision and assistance, and to the 
other members of his research committee, Drs. D. A. 
Grant and J. C. Gilchrist, for their suggestions. The 
author also expresses his appreciation to the judges who 
participated in evaluating the projective material—Drs. 
S. Epstein and H. A. Page and Miss E. Whitehouse. 

2 Now at the Veterans Administration Hospital, 
Dearborn, Michigan 

3 Perceptual defense here refers to the differential 


recognition of threatening and nonthreatening words. 


508 


METHOD 


The methodology employed is a blend of clinical 
and experimental techniques. This approach made it 
possible to obtain relevant information about several 
currently important areas of research; these include 
perception and personality theory, perceptual defense, 
the role of perception in adjustment, and the relation- 
ship between self-perception and adjustment. 

Instruments. Defensiveness in perception of the 
self was studied by means of a Q sort of self-descriptive 
statements. Disagreement between the individual’s 
Q-sort self-description and the Q-sort description of 
him based on projective material represented defensive- 
ness in this situation. 

Defensiveness in terms of perception of the environ- 
ment was studied by means of the tachistoscopic 
presentation of neutral words and threatening words 
for which the threat is personally relevant. In this 
situation defensiveness manifested itself as greater 
difficulty in recognizing the threatening words. 

An adjustment rating scale and the Munroe Inspec- 
tion Rorschach Check List (9) were used to evaluate 
the projective material. Two measures of adjustment 
were derived from these. 

Subjects. The subjects (Ss) were 30 male under- 
graduates who were taking introductory psychology 
at the University of Wisconsin. Their ages ranged from 
18 to 29 years; mean age was 21.4 years with a standard 
deviation of 1.92 years. None of the Ss had previously 
taken any of the tests used here, and none had ever 
been or was currently involved in psychotherapy or 
counseling. 

Procedure. Three sessions were spent with each S. 
The first two sessions were scheduled as close together 
as possible, and the third was held exactly one week 
following the second session. 

During the first session each S§ filled out a bio- 
graphical inventory, and the Rorschach was admin- 
istered. The biographical inventory consisted of 45 
multiple-choice and incomplete-sentence-type items. 
The items were adapted from an inventory described 
by Kelly and Fiske (5). They provided background 
information about each S. 

The second session included the administration of a 
word association test‘ and the Thematic Apperception 
Test. The word association test consisted of 100 words, 





‘ The following have been deposited with the Amer- 
ican Documentation Institute: (a) the word association 
test—Document No. 4163; (6) the Q sort—Document 
No. 4162; (c) the adjustment rating scale—Document 
No. 4161. Order by document number from the ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress, Washington 25, D. C., 
remitting in advance for each one $1.25 for 35 mm. 
microfilm or $1.25 for 6 by 8 in. photocopies. Make 
checks payable to Chief, Photoduplication Service, 
Library of Congress. 
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50 emotional and 50 neutral. These were five-letter 
words selected on the basis of (a) equivalent frequency 
of usage as determined by the Thorndike-Lorge (16) 
word counts and (6) agreement between four judges 
as to the classification of each word. The S’s reaction 
time to each word was measured by means of a voice- 
key type of reaction time apparatus. 

The Rorschach test and the TAT were administered 
and treated according to the procedures outlined by 
Klopfer and Kelley (6) and Murray (10). 

The perceptual defense procedure was administered 
one week after the word association test. On the basis 
of the reaction times obtained on the word association 
test, two groups of ten words each were selected for 
each S to be presented as the stimuli for the perceptual 
defense procedure. One group consisted of the emo- 
tional words that had the highest reaction times; the 
other group consisted of the neutral words with the 
lowest reaction times. In no case was a word selected 
if S gave indication of not knowing its meaning. In this 
way it was possible to select for each S those words 
which represented personally relevant threatening and 
neutral stimuli. 

Three practice words were presented tachistoscop- 
ically to S, and these were followed by the 20 selected 
words presented in random order. Each word was 
presented in the tachistoscope beginning at the .01-sec. 
exposure level, and the exposure time was increased 
consecutively by .01 second until S was able to give 
two consecutive, correct reports of the stimulus word. 
The exposure time at which he correctly reported the 
word for the second time was used as the recognition 
score for that word. 

Immediately following the perceptual defense pro- 
cedure S was given the Q-sort cards with instructions 
to sort them so that they described himself. This Q sort 
consisted of 125 short self-descriptive statements,' e.g., 
“T am impulsive”; “I usually ask for help if I can get 
it.” 

It was E’s purpose to have the Q sort contain items 
about which information could most readily be ob- 
tained from the Rorschach and TAT, and which would 
describe most people to some degree. An attempt was 
made to have the final sample follow the principle of 
randomization (15); that is, the items should interact 
in S’s judgment about them, but each item should not 
be contingent upon any other one. This fina] Q sort 
was developed with the cooperation of four clinically 
trained persons. 

The following Q-sort distribution was employed: 


Frequency: 3 5 7 10 13 16 17 16 13 107 5 3 
Scale value: 121110 9 8 7 6 5 4 3210 
N = 125 


Three judges and E participated in the task of 
evaluating the projective and biographical material ob- 
tained from Ss. The judges were selected on the basis of 
their clinical experience and ability to deal with pro- 
jective data. Each judge worked on ten cases; these 
were assigned in consecutive order as the material 
became available. The £, as the fourth judge, teamed 
up with each of the other judges in evaluating the 
material, and, therefore, participated in evaluating all 
30 Ss. 





5 See footnote 4. 
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At the end of the second and before the third session, 
one of the judges was given the following material: the 
biographical inventory, the Rorschach protocols with 
the scores summarized, the word association test data, 
and the TAT protocols. 

The judge and E worked up their independent 
interpretations of the data, and then met to discuss 
them. After the two reached some agreement about 
their evaluation of the material, they sorted the Q-sort 
items to describe S as they saw him.*® 

The judge and E independently rated S on an ad- 
justment rating scale’ which consisted of 11 subscales. 
A final rating for S on each of the subscales was then 
agreed upon. The content of each scale was derived 
primarily from Maslow and Mittleman (8, ch. IV). A 
descriptive paragraph was provided for the extreme 
ratings of each scale so that these might serve as 
anchoring points for the judges. 

After the judges completed their work, E applied 
the Munroe Inspection Rorschach Check List to Ss’ 
Rorschach protocols. 

Measures and statistics. Three basic scores were 
derived from the data. The first was the accuracy of 
self-description (ASD) score, which was obtained in the 
following way: 

The judges’ Q-sort description of S was considered 
the objective description of him. The S’s Q-sort self- 
description was correlated with the judges’ Q-sort 
description. The product-moment r obtained was trans- 
formed to Fisher’s z, and this score was used as the 
ASD score. The higher the ASD score, the greater the 
agreement between the objective description of S and 
S’s self-description. 

The second basic score was the perceptual defense 
(PD) score. Here two scores were derived. One was the 
difference between the mean recognition threshold for 
neutral words and the mean recognition threshold for 
threatening words (Mt — Mn); this was called the 
PDm score. The other was the difference between the 
standard deviation of the distribution of recognition 
thresholds for the neutral words and the standard 
deviation of the distribution of recognition thresholds 
for threatening words (ot — on); this was called the 
PDs score.® 

The third score was the adjustment (A) score. Here 
again, two measures were derived, one based on the 
Munroe check list and the other derived from the 
judges’ ratings of adjustment. The check-list adjust- 
ment (Ac) score was the number of checks S received. 





6 This procedure was suggested by Dr. Julius See- 

nan, Counseling Center, University of Chicago. 

7See footnote 4. 

8 In the present study it was hoped that the possi- 
bility of the occurrence of response suppression in the 
perceptual defense situation, and intentionally pro- 
duced inaccurate self-description in the Q-sorting pro- 
cedure, would be minimized by using only male Ss and 
by conducting the experimental procedures in an 
atmos;:here which would not inhibit Ss. It is doubtful 
that these did occur to any significant degree especially 
since the sequence of the procedures would also tend to 
lessen the possibility of inhibition. However, if these 
reactions did occur, then these should also be considered 
as defensive reactions. 
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The rating scale adjustment (Ar) score was derived 
by converting each S’s rating on each scale into a 
standard score based on the distribution of the 30 
ratings for that scale. For each S, the Ar score was the 
sum of his standard scores on the 11 subscales. Poor 
adjustment was indicated by a high check-list score, 
or by a low rating on the adjustment rating scale. 


RESULTS 


Table 1 presents the correlations that were 
obtained between the basic scores. It will be 
noticed in this table that in addition to pro- 
duct-moment correlations, biserial and mul- 
tiple correlations were also employed. Pro- 
duct-moment correlations were not used with 
the Ar scores because these scores were based 
on values obtained from a rating scale on 
which the equality of units was questionable. 
The group of 30 Ss was divided into good and 
poor adjustment groups by using the median 
as the dividing point. Biserial r was then 
appropriate (3). Multiple correlations were 
used because of the possibility that a combina- 
tion of the two measures of adjustment would 
give a better estimate of S’s adjustment level 
than would either measure alone. 

The findings are listed below as they relate 
to their respective hypotheses: 

Hypothesis 1. Both the PDm and PDs scores 
correlated negatively and significantly with 
the ASD scores. These findings support the 
first hypothesis. The more accurately the 
individual described himself, the less percep- 
tual defense he showed. 

Hypothesis 2. The Ar scores correlated posi- 





TABLE 1 
SUMMARY OF CORRELATIONS 
Hy- Type of 
poth- orre- 
esis Basic Scores Correlated lation Value 
1 ASD with PDm r — .53** 
ASD with PDs r — .44* 
2 Ac with ASD r — .34 
Ar with ASD ry .73** 
Ar and Ac with ASD R .43°° 
3 Acwith PDm r .29 
Ac with PDs r Dn 
Ar with PDm a — .62** 
Ar with PDs r — .68** 
Ar and Ac with PDm R — .62** 
Ar and Ac with PDs R — .68** 


Note.—Product-moment correlation has been indicated by r. 
Siserial correlation has been indicated by r and multiple correla- 
tion by"R 

* Significant at the .0S level of confidence. 

** Significant at the .01 level of confidence. 
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tively and significantly with the ASD scores. 
Although the Ac scores did not correlate sig- 
nificantly with the ASD scores, the correla- 
tion nevertheless approached significance in 
the predicted direction. A combination of the 
Ar and the Ac scores correlated positively and 
significantly with the ASD scores. In general, 
these results support the second hypothesis. 
The better the adjustment of the individual, 
the more accurate was his self-description. 

Hypothesis 3. The Ar scores correlated nega- 
tively and significantly with the PDm scores. 
The correlation between the Ac and PDm 
scores, although not significant, was in the 
predicted direction. A combination of the Ar 
and the Ac scores correlated negatively and 
significantly with the PDm scores. 

The Ar scores correlated negatively and 
significantly with the PDs scores. The Ac 
scores correlated positively and significantly 
with the PDs scores. A combination of the Ar 
and the Ac scores correlated negatively and 
significantly with the PDs scores. These re- 
sults confirm the third hypothesis. The more 
adequate the individual’s personal adjustment, 
the less perceptual defense he showed. 

In addition to the correlations listed in 
Table 1, the two measures of adjustment were 
correlated. A biserial r of —.56 was obtained 
and is significant beyond the .05 level. 

It should be noted that the adjustment 
scores that were used were not independently 
obtained criterion measures. The judges’ 
Q-sort descriptions of Ss were based on the 
same projective material from which both 
measures of adjustment were derived. It would 
have been advantageous to have obtained 
an independent evaluation of adjustment, 
but unfortunately this was not feasible. Con- 
sequently, the possibility that the correlations 
of the Ar and Ac scores with the ASD scores 
are somewhat inflated must be considered. 
The correlations involving the PD scores are 
not open to the same criticism, however. 


DISCUSSION 


Adjustment and the handling of perceived 
threat. An interesting finding became evident 
when the individual scores were perused. Of 
the 30 Ss, 11 recognized the threatening words 
more quickly than the neutral words. These 
Ss appear to be the sensitizers spoken about 
in other perceptual defense studies (1). Of 
these 11 Ss, 10 belong to the group of 15 Ss 
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who have the highest ASD scores. Also, of 
these 11 sensitizers, 8 are included in the group 
of 15 Ss who have the highest Ar scores. When 
these results are considered with the major 
findings of this study, the following formula- 
tion is suggested: 

The better adjusted individual attempts to 
obtain mastery over threatening situations by 
getting to know, as quickly as possible, what 
it is that is threatening. He may be alerted to 
the possibility of threat and may be influenced 
by a set which leads him to try to differentiate 
and symbolize aspects of himself and his en- 
vironment which are potentially threatening. 
The less adequately adjusted individual, in 
contrast, may be thought of as influenced by 
a set to keep threat inadequately differen- 
tiated and inadequately symbolized. 

Furthermore, it can be argued that adequate 
behavior depends upon adequate perception. 
If an individual cannot differentiate and sym- 
bolize aspects of himself and his environment, 
his behavior in turn will not be adequate. He 
will also be unable to resolve threatening situa- 
tions when they confront him, for if he does 
not recognize threat, how can he be expected 
to deal with it effectively? 

It may be that for the well-adjusted person 
threat is a relatively momentary phenomenon. 
He is only threatened initially; then he extends 
himself in order to handle the threat. As a 
result, the threat is minimized. For the mal- 
adjusted person, threat persists because he 
cannot deal with it directly. Instead he tries 
to remove himself from it by denial or misper- 
ception. 

Perceptual defense an individual, not a 
general, reaction. The concept of perceptual 
defense, an unconscious mechanism of resis- 
tance to recognition of threatening material, 
has frequently been used to explain the higher 
recognition thresholds for taboo words than 
for neutral words (1, 7). Whether or not such 
a mechanism is necessary has been questioned 
by some workers in the field (4, 11). 

In the present study, no difference was 
found between the mean recognition thresholds 
for threatening and nonthreatening words. 
The ¢ test was used to evaluate the difference 
between the means (Mt = 15.11, ot = 7.62; 
Mn = 14.35, on = 8.91), and a value of ¢ 
= .75 was obtained. This is obviously not 
significant and indicates that perceptual de- 
fense did not occur for the group as a whole. 


Sil 


The matter cannot be dismissed at that, how- 
ever, for meaningful differences have been 
found in terms of individual reactions. The 
importance of these differences has been in- 
dicated by the major findings of this study. 
These findings suggest therefore that percep- 
tual defense is not a general phenomenon but 
may be a process which some individuals are 
more apt to manifest than others. 

Importance of using individually selected, 
personally relevant stimuli. Consideration must 
be given to the possibility that there are 
different kinds of threat. Taboo words like 
whore and bitch may present threat in the 
sense of social prohibition which is uniformly 
imposed on most people; these words there- 
fore, can generally be expected to produce 
elevated recognition thresholds. Other words 
like mammy and haven, if threatening to the 
individual, may represent more basic psy- 
chologic threat, and may be more intimately 
related to the person’s personality trends. 
The handling of such threatening material is 
much more of an individual matter. 


SUMMARY 


The present research was designed as an 
attempt to verify the following hypotheses: 

1. The greater the agreement between the 
individual’s self-description and an objective 
description of him, the less perceptual defense 
he will show. 

2. The greater the agreement between the 
individual’s self-description and an objective 
description of him, the more adequate will 
be his personal adjustment. 

3. The more adequate the personal adjust- 
ment of the individual, the less perceptual 
defense he will show. 

The results in general supported all hypoth- 
eses. The following can be concluded: 

1. In a group of Ss who show varying degrees 
of adjustment and defensiveness, one finds 
that the more inaccurate and faulty the in- 
dividual’s perception of his environment, the 
more inaccurate and faulty is his perception 
of himself; and the more inaccurate and faulty 
the individual’s perceptions of himself and 
his environment, the more inadequate is his 
personal adjustment. 

2. Perceptual defense is meaningful when it 
is considered as an individual phenomenon 
rather than a general or universal one. Im- 
portant differences exist between persons and 
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apparently can be demonstrated if personally 
relevant, individually selected stimuli are 
used. 


REFERENCES 


. Bruner, J. S., & Postman, L. Emotional selec- 
tivity in perception and reaction. J. Pers., 1947, 
56, 69-77. 

. Comps, A. W. A phenomenological approach to 
adjustment theory. J. abnorm. soc. Psychol., 
1949, 44, 29-35. 

. Cronsacu, L. J. Statistical methods applied to 
Rorschach scores: a review. Psychol. Bull., 1949, 
46, 393-429. 

. Howes, D. H. & Sotomon, R. L. A note on Mc- 
Ginnies’ ‘“Emotionality and perceptual defense.” 
Psychol. Rev., 1950, 57, 229-234. 

. Ketry, E. L., & Fiske, D. W. The prediction of 
performance in clinical psychology. Ann Arbor: 
Univer. of Michigan Press, 1952. 

. Kioprer, B., & Kerrey, D. M. The Rorschach 
technique. Yonkers, N. Y.: World Book Co., 1946. 

. McGrynies, E. Emotionality and _ perceptual 
defense. Psychol. Rev., 1949, 56, 244-251. 


8. 


9. 


Mastow, A. H., & MitTTLeMAN, B. Principles of 
abnormal psychology. New York: Harper, 1941. 
Monroe, Ruts L. Prediction of adjustment and 
academic performance of college students by a 
modification of the Rorschach method. Appl. 

Psychol. Monogr., 1945, No. 7. 


. Murray, H. A. Thematic Apperception Test. 


Cambridge: Harvard Univer. Press, 1943. 


. PostMAN, L., Bronson, W. C., & Gropper, G. L. 


Is there a mechanism of perceptual defense? J. 
abnorm. soc. Psychol., 1953, 48, 215-223. 

. Rocers, C. R. The organization of personality 
Amer. Psychologist, 1947, 2, 358-368. 


. Rocers, C. R. Client-centered therapy. New York: 


Houghton Mifflin, 1951. 

. Snyoc, D., & Comss, A. W. Individual behavior: a 
new frame of reference for psychology. New York: 
Harper, 1949. 

. STEPHENSON, W. Some observations on Q tech- 
nique. Psychol. Bull., 1952, 49, 483-498. 

. THORNDIKE, E. L., & Loree, I. The teacher’s word 
book of 30,000 words. New York: Bureau of 
Publications, Teachers Coll., Columbia Univer., 
1944. 


Received June 25, 1953. 





AN EXPERIMENTAL INVESTIGATION OF THE EFFECT OF UNSTABLE 
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N ORDER to work together, live together, 
or even to maintain the barest degree of 
social contact with one another, it is 

necessary for persons to share ideas, to com- 
municate attitudes and feelings and informa- 
tion. While much communication is intended 
as such, some of the communication which 
occurs among persons may be termed “in- 
direct” or “inadvertent” communication. This 
occurs when information is conveyed to 
others, not by actually talking about it, but 
by behaving in such a way that someone else 
can draw inferences. 

There are some kinds of things about 
which communication is almost never direct 
but is typically inadvertent. Among adult 
persons of the same sex, for example, it is 
quite rare to have direct communication 
about how much they like each other. This 
type of direct communication about social 
preferences does go on among children, but 
among adults it would generally be embar- 
rassing. People do, however, behave differ- 
ently toward those they like and toward those 
they dislike and in this manner achieve com- 
munication about these feelings. Certainly, 
among people who interact with one another, 
there is knowledge of how cthers feel about 
them even though there may never have been 
any direct communication on the subject. 

How much knowledge persons in a group 

1 The collaboration on this study was made possible 
by a grant from the Rockefeller Foundation. Initial 
planning of the study was carried out jointly by the two 
authors in Ann Arbor, Michigan. After preliminary ex- 
perimentation was done, the final design was agreed 
upon during conferences in Leiden, Holland. The actual 
experimental work in Holland was done at the Neder- 
lands Instituut voor Praeventieve Geneeskunde with 
funds provided by the Nederlandse Organisatie voor 
Zuiver Wetenschappelijk Onderzoek. In the United 
States it was done at and supported by the Laboratory 
for Research in Social Relations of the University of 
Minnesota. The authors wish to express their apprecia- 
tion and indebtedness to Aart Hazewinkel and Els v. d. 
Schoot, who helped conduct the experiment in Holland, 
and to Jay Jackson and Robert Hall, who helped in the 
United States. 


have concerning how others feel about them 
is determined by two factors, assuming a 
complete absence of any direct communication 
on the subject. One factor is the amount of 
difference between behavior toward those who 
are liked and toward those who are disliked. 
The other factor is how sensitive others are to 
these differences in behavior. In a group that 
behaves very formally and strictly according 
to some rules of etiquette and politeness, for 
example, we would expect the differences in 
behavior toward those liked and disliked to be 
minimal. We would not, consequently, expect 
to find very accurate knowledge in such a 
group concerning likes and dislikes. Also, of 
course, no matter how large the differences in 
behavior directed toward different persons 
might be, we would expect little accurate 
knowledge if, for one reason or another, 
people paid no attention to, or were not sensi- 
tive about, these cues. 

This latter factor of sensitivity is maxi- 
mally important in inadvertent communica- 
tion and minimally important in direct 
communication. In direct communication the 
person must only be listening in order to have 
the communication impinge upon him. In 
inadvertent communication, however, this 
factor of sensitivity may make a great dif- 
ference. 

The present experiment attempts to study 
the effect of the stability of the interpersonal 
relations within the group on the accuracy of 
perception of social preference. It may be 
expected that persons whose relations with 
others in a group are stable may be different 
from those whose relations are unstable with 
respect to both sensitivity and the cues they 
make available concerning their feelings for 
others. 

It is possible that those whose relations in a 
group are stable, because of their greater 
security, would be less cautious in their 
behavior and, consequently, provide more 
cues from which others could infer their likes 
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and dislikes. One might also expect that those 
whose relations in the group are unstable 
would be more sensitive to the cues available 
because they would be more concerned with 
finding out how others feel toward them. The 
experiment was consequently designed so that 
half the people in each group were put in a 
situation where they felt secure and stable 
about their interpersonal relations in the 
group while the other half were made to feel 
unstable about these relations. After they had 
had a period of time in which to interact 
with one another, measurements were made 
from which one can calculate how accurate 
their knowledge was of how others felt about 
them. From observation of the interaction 
among thern one can calculate whether there 
was any difference between Ss in the two 
conditions in behavior toward others. 

The experiment was conducted simultane- 
ously in two different countries, Holland and 
the United States. The same number of 
groups was run and the identical procedures 
were used (within the limits of translation of 
language between English and Dutch) in 
both countries. The comparison of the results 
in the two countries can provide us with some 
evidence concerning how independent the 
findings are of cultural differences, at least 
between these two countries. 


METHOD 


Subjects. In each of the two countries data were 
collected on 40 experimental groups, each group con- 
sisting of six subjects (Ss) of the same sex. Twenty 
male and 20 female groups were run in Minneapolis; 28 
male and 12 female groups were run in Holland. All Ss 
were undergraduate students at the University of 
Minnesota, Leiden, or Amsterdam. The Ss scheduled 
for any particular group were always strangers to one 
another. For the purposes of this experiment stranger 
was defined as not knowing the other person’s name. 
Occasionally there were persons in the same group who 
knew each other by sight but did not know each other’s 
name. In Holland there were occasional deviations from 
this rule. Persons who knew each other’s name were 
sometimes in the same group if the experimenter (£) 
felt sure that their acquaintance was only superficial. 

Procedure. The Ss, upon their arrival, were seated in 
a U-shaped arrangement. In the United States the 
observers (Os) were located behind a one-way observa- 
tion mirror which faced the open end of the U. In 
Holland one O was sometimes behind a one-way ob- 
servation mirror and sometimes in the room with Ss. 
The other Os were always in the room with Ss. In this 
way it was possible for Os to see all the faces of the 
persons in the group. A letter from A to F was placed 
in tront of each S for identification purposes. 


LEON FESTINGER AND HERMAN A. HouTTreE 





The Ss were asked to introduce themselves to each 
other by telling their names, what part of the country 
they came from, and what they were studying. This was 
the only information any of the Ss had about the others 
at the start of the experiment. 

The E then verbally gave the following instructions: 

This aiternoon you will have a discussion 
among the six of you in which you will make 
up a plot for a radio play. Before we start doing 
that it is, however, necessary for the purposes 
of this experiment for us to know how you feel 
about each other. After the experiment I will 
explain to you why we do this. We would like 
you to write down on a slip of paper which will 
be passed out to you in a minute how well you 
like each of the other persons in this group. On 
this slip we would like you to write down the 
letters of each of the other persons in your 
group alongside that statement that best ex- 
presses your feelings toward each other. 

The following slip was then passed out: 


My Letter is ___ 
. likes me exceptionally well — 
. likes me quite well 
likes me moderately 
. likes me a little 
. rather indifferent 
. dislikes me a little 
. dislikes me definitely 
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As soon as the slips had been passed out, E continued: 

Some of you may be a bit doubtful about 
these “first impressions” of people. I can, how- 
ever, assure you that we have found in the 
past, in many studies we have done, that first 
impressions of people are extremely im- 
portant and surprisingly valid. For example, 
it frequently happens that spontaneous lasting 
friendships are formed which correspond 
exactly to such first impressions. Also, spon- 
taneous dislikes sometimes occur between 
people who met for the first time which prove 
to be quite stable. We would like you, then, to 
take your time in filling out these slips so that 
you can do it seriously. 

The last instruction was given in an attempt to 
make Ss take these initia] ratings seriously. Since the 
adequacy of the experimental manipulation depended 
to a great extent on how important Ss felt these initial 
ratings to be, this last instruction was put quite force- 
fully. After Ss had finished filling out the slips, they 
were also asked to encircle the letters of the two persons 
in the group whom they liked best. The slips were then 
collected and E continued: 

We would now like you to estimate how 
each of the others rated you a moment ago. For 
that purpose a slip of paper will now be passed 
out to you. Will you put the letter of each 
other person in the group alongside that 
statement which describes the way you think 
that person feels toward you. 

After this second slip, similar in form to the first slip, 
had been filled out and collected, E went on: 

We will now give you some information 
about how the two that you like best in the 
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group feel about each other. It is possible to 
give you this information because while you 
were filling out the second slips, Mr. 
has been tabulating the first slips. We will now 
pass out to each of you a slip which will tell 
you how those two which you said you liked 
best feel about each other. 
The slips containing this information were then dis- 
tributed. The actual information given to Ss at this 
point was fictitious and constitutes the experimental 
manipulation. We will describe this in detail in the 
next section. The E then said: 

We are now ready for you to start your 
discussion about the radio play. We will pass 
out to you a plot which is unfinished and we 
would like you to discuss among yourselves 
how you think it should end. You are com- 
pletely free to decide upon any solution of the 
problems in the plot. Keep in mind a short 
scene lasting about ten minutes. I will now pass 
out copies of the unfinished plot to each of you. 
Please read it carefully and let me know when 
you are ready to start the discussion. 
following plot was then passed out: 

Henry, the son of a physician, has a friend, 
Jim, who is under the care of Henry’s father. 
Henry knows that Jim is incurably ill. Both 
are in love with a girl, Ellen. Jim doesn’t know 
what kind of disease he has; neither does Ellen 
know he is incurably ill. One night Henry calls 
on Ellen just after he had decided to give up 
his studies and accept a job in California. He 
intends to ask her that night to marry him and 
go with him to California. Henry knows that 
for many years Ellen has wanted to go to live 
there. Before he gets a chance, however, to tell 
her, Ellen announces her engagement to Jim. 
What is Henry going to say and do and what 
will happen? 

After all six Ss had finished reading the plot, E 
started the discussion, telling Ss that he himself would 
sit at the side of the room and would not participate. 
The discussion went on for 20 minutes during which 
time Os recorded certain aspects of the communication 
process. These observations will be described in detail 
later. When 20 minutes had gone by from the time 
the discussion started, E said: 

We now have to interrupt your discussion 
for a moment because we want you to repeat 
what you did at the beginning, that is, to in- 
dicate again on the slips of paper to be passed 
out to you how you feel about the others in the 
group. 

Slips identical in form to the first slips were then 
passed out. After they were filled out by Ss and col- 
lected, E continued: 

And we also would like you to again 
estimate how the others in the group rated you. 
You see, one of the main purposes of the 
experiment is to see how likes and dislikes are 
formed and how well people know how others 
feel about them. So please give this your full 
attention and think it over well before you 
write down your estimates. 

After these slips had been distributed, filled out, and 
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collected, Ss were given a questionnaire to answer. We 
will describe the exact questions asked as we deal with 
them in discussing the results. The Ss were then told 
that the experiment was over and were given a full 
explanation of what had been done and why. 

Experimental manipulation of stability of inter personal 
relations. When Ss were given information as to how 
the two persons they liked best? felt about each other, 
this information was systematically fabricated to 
produce conditions of stability or instability of inter- 
personal relations. In each group, three Ss were told 
that their two top choices liked each other (one rating 
of 6 and one of 7), while the other three were told that 
their top choices disliked each other (one rating of 1 
and one of 2). The former information was intended to 
make Ss feel that their relations in the group were 
comfortable in that those persons whom they liked best 
were also compatible with one another. This condition, 
which we will call the compatible condition, should 
have the effect of stabilizing their preferences for others 
and should make them feel relatively secure about their 
personal relations. The condition in which Ss were told 
that their top choices disliked each other was intended 
to have the opposite effect. Here it was felt that the 
knowledge of the incompatibility between the two 
people they liked best would make them feel uncom- 
fortable about their relations with others in the group. 
This condition, which we will call the incompatible 
condition, should make Ss relatively uncertain about 
their interpersonal relations in the group and should 
make their preferences for others unstable. Such rela- 
tions among three persons have also been discussed by 
Heider (1) in a manner similar to the present discussion. 
We will have more to say later concerning the relevance 
of some of our findings to Heider’s theory. 

The Ss were put at random in either the compatible 
or the incompatible condition. Prearranged sequences 
were followed which systematically varied condition 
by position around the table. The Ss were in one or 
another condition depending upon the particular posi- 
tion sequence predesignated for that group. 

Observation of the group discussion. During the 20- 
minute discussion, observations were made of who 
talked, for how long, and to whom. In both countries 
the recording of the observations was done on a moving 
tape so that the duration of any communication could 
be computed afterwards. The Os had to indicate which 
person spoke and when he stopped speaking. Direction 
of communication was indicated by either writing down 
the letter of the person addressed or indicating, where 
appropriate, that the remark had been addressed to 
the group as a whole. 

For the data collected in Leiden, each group was 
observed by three persons. The records of one of the 
Os was used for the analysis. His record was modified 
if both other Os disagreed with him. The reliability of 
observation of who spoke to whom was .84 computed 
by product-moment correlation. For the data collected 
in Minneapolis, two Os were present for only 10 of the 
40 groups. On 66 per cent of all items recorded the two 
Os agreed both as to who was talking and to whom the 





* For the sake of ease of expression, we will refer to 
the two persons who are most liked as the two top 
choices. 
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communication was directed. Most of the disagree- 
ments involved one O’s saying that the remark was 
directed to some specific person while the other recorded 
it as having been directed toward the group. The data 
used in analysis are those observations made by that O 
who observed all of the groups. 

Questionnaire data. Data collected on slips which Ss 
filled out during the experiment and also data collected 
on the postexperimental questionnaire can both be con- 
sidered questionnaire data. There follows a summary of 
the data thus collected: 

1. Ratings of how well Ss like the other members of 
the group both before and after the discussion. 

2. Estimates of how well each of the others like 
them both before and after the discussion. 

3. Estimates of how much Ss’ top choices like each 
other after the discussion. 

4. Perceptions of how much Ss talked to others and 
others talked to them. 


RESULTS 


Adequacy of the experimental manipulation. 
It will be recalled that the experimental 
manipulation consisted of an attempt to have 
some persons think that the two members of 
their group whom they liked best liked each 
other (compatible condition), while others 
were given the impression that their two top 
choices disliked each other (incompatible con- 
dition). In order to demonstrate how ade- 
quately this was accomplished, it is not 
sufficient to show that Ss believed what they 
were told. The course of the experiment, in- 
cluding the discussion, might present other 
evidence to them. The adequacy of the ma- 
nipulation can only be judged in terms of 
whether, at the conclusion of the experiment, 
they still retained the impression which they 
were given. 

In order to gather these data, Ss were 
asked in the postexperiment questionnaire to 
state once more which two members of the 
group they had liked best at first. They were 


TABLE 1 
JupGMENtTs oF How WELL THE Two INITIALLY BEsT 
Likep Persons Like Eacu OTHER AFTER THE 
EXPERIMENT 


Minneapolis Leiden 
Condition 
Mean Mean 
Rating* Rating* 
Compatible 120 3.4 120 3.3 
Incompatible 118f 2.1 120 1.9 


* Scale from 1 (they dislike each other) to 4 (they like each 
other very much 


t Two Ss did not answer the question 
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then asked, “What is your best judgment of 
how those two feel about each other now?” 
They were given an opportunity to check one 
of four possible answers on a scale from (1) 
they dislike each other to (4) they like each 
other very much. Table 1 presents the results 
of the answers to this question. 

It is clear that in both countries the manip- 
ulation was successful. The Ss in the com- 
patible condition judged that their top choices 
like each other much more than did Ss in the 
incompatible condition. In every one of the 
80 groups the average for Ss in the compatible 
condition was higher than for Ss in the in- 
compatible condition. We may then conclude 
that the difference is highly significant. It is 
also interesting to note that the mean values 
for each of the two conditions are very much 
alike for both countries. 

Let us now turn our consideration to the 
indirect effects which it was hoped the manip- 
ulation would have. Our hypothesis was that 
Ss in the incompatible condition would be 
less stable in their liking of others than Ss in 
the compatible condition. Because they had 
been given the impression that their top 
choices liked each other, Ss in the compatible 
condition would find themselves in a stable 
interpersonal situation, and we would not 
expect many of them to change their prefer- 
ences during the experiment. On the other 
hand, Ss in the incompatible condition who 
had the impression that their top choices 
disliked each other, would feel uncomfortable 
about the interpersonal situation, and we 
would expect many of them to change their 
feelings about at least one of their two best- 
liked persons during the experiment. Of 
course, there are many other factors aside 
from this one that could induce people to 
change their preferences, and so, if the above 
hypothesis is correct, we can only hope to 
demonstrate a difference between the condi- 
tions. Table 2 presents the data on the num- 
ber of persons in each condition who changed 
at least one of their top two choices. 

It can be seen that those in the incompatible 
condition do indeed change at least one of 
their two top choices much more frequently 
than do those in the compatible condition. 
Again, the data from both countries are very 
similar although there is a slightly larger dif- 
ference in the Leiden data than in the Min- 
neapolis data. When we evaluate the signifi- 
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TABLE 2 


NUMBER OF PERSONS CHANGING Two Top CHOICES 
DuriInG EXPERIMENT 


Number Changing 
at Least One of 
Their Two Top 

Choices 


Number not 


Condition Changing 


Mpls. Leiden Mpls. Leiden 
Compatible 66 53 54 67 


Incompatible 96 94 22 26 


cance of the differences on the basis of 
consistency from group to group, we find the 
differences to be highly significant. The number 
of groups in which more incompatibles than 
compatibles changed at least one of their two 
top choices is 25 and 26 for Minneapolis and 
Leiden respectively. The number of groups in 
which the direction of difference was reversed 
was 7 and 4 for Minneapolis and Leiden 
respectively. The same number changed in the 
other 18 groups. Tested by chi square (ignoring 
the equal groups) the difference is significant 
beyond the .01 level of confidence in each 
country. 

The data do corroborate the hypothesis, and 
we may conclude that in both countries, one 
of the effects of the manipulation was to 
produce a tendency for the incompatibles to 
change at least one of their two top choices 
during the experiment. This result is also 
corroboration of Heider’s (2, 3) theory con- 
cerning such relationships. 

If it is true that many Ss in the incompatible 
condition are changing their two top choices 
because of the perception that those two dis- 
like each other, we would expect them to 
perceive greater liking between those they 
finally choose as the ones they like best. 
Among the Ss in the compatible condition, 
however, changes which occurred would have 
been for any of a variety of other reasons. We 
would then expect no such change in perception 
of how the two whom they liked best after the 
experiment like each other. 

In the questionnaire at the conclusion of the 
experiment Ss were asked to state which two 
in the group they mow liked best and also to 
state their best judgment of how these two 
now felt about each other. These data are 
presented in Table 3. 

For those in the compatible condition, the 
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changes in average perceived liking between 
their two top choices are negligible. For Ss 
in the incompatible condition the shifts in 
their two top choices are significantly toward 
persons who are perceived as liking each other 
better. Looking at consistency from group to 
group we find that in 32 of the Minneapolis 
groups and in 33 of the Leiden groups the 
change in perceived compatibility from the 
initial top choices to the final top choices was 
greater in the incompatible than in the com- 
patible condition. In only four groups (all in 
Minneapolis) was the direction reversed. We 
thus have confirmation that, in the incom- 
patible condition the experimentally produced 
belief that top choices did not like each other 
was a major factor in producing changes in 
preferences. 

The relation between social preference and 
frequency of communication. During the course 
of the 20-minute discussion, observations were 
made of who spoke, for how long, and to whom 
the communications were directed. We may 
look at these data to determine whether the 
direction of communication was affected by 
the preferences which persons had for the 
others in the group. To the extent that there 
are differences in how often members spoke 
to those they liked, to those they disliked, or 
to those they were indifferent to, we may say 
that cues were present during the discussion 
from which these preferences could be judged. 
The data on the average length of each 
communication do not show any clear trends. 
Our presentation will consequently deal with 
the frequency of communication, that is, how 
often communications were addressed to 
certain others in the group. 

In examining the relation between prefer- 
ences for others and number of communica- 
tions addressed to them, it is possible to look 


TABLE 3 
PERCEIVED PREFERENCES BETWEEN Two Top CHOICES 
BY THOSE WHO CHANGE AT LEAST ONE OF THE Top 
CHOICES 


Judgment of How Well Two Top 
Choices Like Each Other 


Condition Minneapolis Leiden 


Final 
Two 


Initial 
Two 


Final 


Two 


Initial 
Two 

2.9 

2.8 


Compatible 3 
Incompatible 


a . A 
_ : 8 
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at the data in relation to initial preferences 
(ratings made at the beginning of the dis- 
cussion) or in relation to the final preferences 
(ratings made after the discussion). In general, 
the same trends are reflected in both ways of 
examining the data. Since our primary 
objective is to determine whether or not cues 
were present from which members of the group 
could judge how others felt about them, and 
since the accuracy of this judgment is in terms 
of the preferences stated after the discussion, 
it makes more sense to look at the relation 
between frequency of communication and final 
preferences. We shall, therefore, confine our 
discussion here to this relationship. Table 4 
presents the relevant data. 

There is clearly a relationship between the 
rating given to a person (how much one likes 
him) and the number of communications 
addressed to that person. But this relationship 
is not monotonic. For both the compatible and 
incompatible conditions, both in Minneapolis 
and in Leiden, the same type of relationship is 
obtained. The fewest number of communica- 
tions are addressed to those rated 3 (indif- 
ferent). The frequency of communication to 
those rated 1 or 2 (dislike) is considerably 
greater, and it also increases on the other side 
of the continuum. More and more communica- 
tions are addressed to people the better they 
are liked. The differences between the fre- 
quency of communication to those rated 3 
and to those rated 6 or 7 are significant at 
at least the .05 level for each condition in each 
country. The differences between the number 
of communications addressed to those rated 3 
and to those rated 1 or 2 are also significant at 
beyond the .05 level for each condition in each 
country. We may conclude that the amount of 


TABLE 4 
MEAN NUMBER OF COMMUNICATIONS ADDRESSED TO 


OTHERS ACCORDING TO FINAL RATINGS OF 
PREFERENCE 
Mean Number of Com- 
munications Addressed 
Condition to Those Rated 

lor2}3|4/;5/|6)|7 
Compatible (M) 4.1 1.51.61.93.44.6 
Compatible (L) 7.1 2.53.43.74.04.8 
Incompatible (M) 3.0 1.42.23.23.75.7 
Incompatible (L) 4.1 2.23.34.45.17.2 
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communication was an available cue which 
could be used by the members of the group to 
judge how others felt about them and that 
these cues were the same, roughly, in both 
countries. It is likely that the communications 
addressed to those rated 1 or 2 were qualita- 
tively different from those addressed to people 
rated 6 or 7 but, unfortunately, no data were 
collected which are relevant to this point. 

We may now raise the question as to whether 
or not the communication pattern, and hence 
the available cues, were different between the 
compatible and incompatible conditions. We 
may note in Table 4 that for both countries 
the frequency of communication addressed to 
those rated 1 or 2 is higher for the compatible 
condition than for the incompatible condition. 
The figures are about equal between the two 
conditions for those rated 3 (indifferent). In 
the incompatible condition Ss communicated 
more to those rated 5, 6, or 7 than in the 
compatible condition. In other words, in the 
incompatible condition Ss communicated 
relatively more to those they like and rela- 
tively less to those they dislike. While the 
consistency of the differences between the two 
countries lends weight to them, the differences 
within each country between the compatibles 
and incompatibles are not highly significant. 
For the Minneapolis data the difference 
between the conditions is significant (p < .01) 
for communications addressed to those rated 
4 or 5. For the Leiden data the difference 
between conditions in frequency of communica- 
tion to those rated 1 or 2 is significant 
(p < .05). None of the other differences 
approaches an adequate level of significance. 
Mainly because of the startling consistency 
between the two countries, we may conclude 
that there is some difference in the nature of 
the cues made available by those in different 
conditions. 

We can make only tentative suggestions as 
to why this difference exists between the 
conditions. It is possible that Ss in the com- 
patible condition, feeling more secure in their 
personal relationships, are more able to 
directly express their likes and dislikes toward 
others. Thus they are able to react to those 
they dislike. The Ss in the incompatible 
condition, on the other hand, may feel less 
able to express their dislikes directly and so, 
whenever possible, prefer to withdraw from 





contact with those they dislike and maintain 
more contact with those they like. If this 
interpretation of the difference between 
conditions is correct, we would expect to find 
that Ss in the compatible condition show their 
dislike when talking to those they rate 1 or 2. 
Unfortunately we do not have these data, and 
so the interpretation must remain highly 
tentative. 

In comparing the data from Minneapolis 
with the data from Leiden one is struck by the 
consistently higher numbers of individual 
communications in the Leiden data. This is a 
function of the proportion of the communica- 
tions that Os designated as directed toward the 
group rather than toward specific individuals. 
In Minneapolis, 58 per cent of the communica- 
tions were recorded as directed toward specific 
individuals while in Leiden the comparable 
figure was 75 per cent. This was probably a 
difference in observer practice rather than a 
real difference in behavior of the groups. If we 
correct the Minneapolis or the Leiden data for 
the difference in proportion of communications 
judged to be addressed to specific individuals, 
the difference in absolute frequency disap- 
pears. 

Perception of the patiern of communication. 
In the preceding section we have demonstrated 
that cues were available from the frequency of 
communication on the basis of which it would 
be possible for an individual S to judge. how 
others in the group felt toward him. Before 
these cues could be used, of course, it would be 
necessary to perceive them. We will now 
examine data relevant to the question of 
whether or not these cues were perceived. 

On the questionnaire after the experiment 
Ss were asked to indicate on a scale from 
0 (not at all) to 4 (very much) how much they 
had talked to each of the others in the group 
and also how much each of the others had 
talked to them. We may then examine these 
data to determine if Ss were aware of the 
pattern of communication that existed in the 
group. 

Table 5 presents the data on the relationship 
between Os’ records of the discussion and the 
ratings on the questionnaire. 

The data in Table 5 show clearly that Ss 
were able to report the pattern of communica- 
tion fairly accurately, both as to whom they 
talked to and who talked to them. Again the 
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TABLE 5 
ACCURACY OF PERCEPTION OF THE COMMUNICATION 


Rating on Question- 
naire of “How Much 
I Talked to Them” 


Observed average frequency of 
communication given by 
subject in: 


Compatible Condition (M) §1.42.35.0 7.4 
Incompatible Condition (M) 61.83.05.4) 8.3 
Compatible Condition (L) 41.73.87.011.1 
Incompatible Condition (L) 62.04.1,7.613.0 


Rating on Question- 

naire of “How Much 

They Talked to Me” 
Ori i2isi ¢ 


Average observed frequency of 
communication received by: 


Compatible Condition (M) §1.42.34.5 7.0 
Incompatible Condition (M) §1.2/3.05.6 7.7 
Compatible Condition (L) §1.74.57.112.1 

1.22.44.27.512.3 


Incompatible Condition (L) 


data are similar between the two conditions 
and between the two countries. It is important 
to note that, considering the data in a some- 
what different light, Table 5 not only shows 
that Ss perceived the pattern of communica- 
tion, but also it corroborates the validity of 
the observation of direction of communication. 

We may now ask whether or not the amount 
of communication directed toward specific 
persons, which potentially could give cues for 
judging how the other person felt, seems to 
have been used in this manner by Ss. Table 6 
presents the relation between the average 
number of communications received and Ss’ 
estimates of how well the communicator liked 
them. 

Inspection of Table 6 shows that Ss in both 
conditions and in both countries seem to have 
used the available cues. In all instances, the 
same nonmonotonic relationship that we 
observed in Table 4 holds. The Ss judged those 
who talked to them least as being indifferent 
toward them. Those whom Ss judged to either 
like or dislike them direct communications to 
them more frequently than those judged to be 
indifferent. In two out of the four comparisons 
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TABLE 6 
RELATION BETWEEN AVERAGE FREQUENCY OF Com- 
MUNICATION RECEIVED AND FINAL ESTIMATES OF 
How We tt Orners LIKE THF 


Final Estimates of Preference 
Condition 


lor2) 3 4 5 6or7 
Compatible (M) 3.9 | 2.1 | 2.1 | 2.7 | 4.0 
Incompatible (M) | 2.5) 1.9 3.0) 4.1 | 4.1 
Compatible (L) 5.4/2.9 | 3.2 | 4.1 | 5.6 
Incompatible (L) 5.5 | 2.8 | d.2 1 3.2168 


(two conditions in each of two countries) the 
difference in amount of communication 
received from those judged to be indifferent 
(rating of 3) and those judged to dislike them 
(ratings of 1 or 2) is significant (p < .05). 
The difference between communication re- 
ceived from those judged to rate them 3 and 
those judged to rate them 6 or 7 is significant 
at the .01 level in every instance. We may then 
conclude that Ss in both conditions, in both 
countries, seem to have used the cues, and we 
may then expect to find that they know, 
significantly better than chance, how others in 
the group feel toward them after the discussion. 

Accuracy of knowledge of preference. We may 
now turn our attention to the question of 
whether or not the accuracy of estimation of 
how others felt about them was better than 
chance. 

For each estimate that Ss made concerning 
how others felt about them, we may calculate 
the average actual rating of preference 
received. If there are no differences among 
these average ratings for different estimates, 
then we would conclude that Ss had no 

TABLE 7 
AccuRACY OF ESTIMATION OF 
DISCUSSION 


PREFERENCE BEFORE 


Estimate of Others’ Preference 


“ee for Themselves 
Condition 


lor2; 3 4 5 6or7 
Compatible (M) 5.1 | 4.7 | 4.7 | 4.6 | 4.9 
Incompatible (M) | 4.7 | 4.5 4.7 4.8 5.0 
Compatible (L) 4.3 | 4.3 | 4.5 | 4.8 | 4.9 
Incompatible (L) 4.2! 4.2) 4.5 | 4.7 | 5.0 


Note.—Each figure in the table is the average actual rating 
given to those who estimated that that person would rate them a 
given way 
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knowledge of how others really felt about 
them. Table 7 presents the data on accuracy 
of estimation of preference at the beginning of 
the experiment, that is, before Ss knew each 
other or had had much opportunity to inter- 
act. 

Examination of Table 7 shows that there 
was very little knowledge beforehand of how 
others in the group felt about them. This is of 
course to be expected. The groups were 
composed ef people who were strangers to one 
another and, at the time these ratings and 
estimates were made, knew nothing about 
each other except their name, what part of 
the country they came from, and what they 
were studying. The Minneapolis data show no 
relation between estimates made and actual 
preference. The Leiden data do show a slight 
relationship. The average ratings tend to go 
up as the estimates increase. The differences 
for both conditions between the average 
ratings for estimates of 3 and estimates of 5 
are significant beyond the .05 level. It is 
possible that in Holland, name, part of 
country, and field of study are more important 
items of information than they are in the 
United States. Such a difference, which it is 
plausible to assume, would account for the 
slight evidence of knowledge in the Leiden 
data before the discussion. 

There are, of course, no differences between 
the compatible and incompatible conditions in 
either country. It would indeed be strange if 
there were differences since the actual manipu- 
lation which created the distinction between 
the conditions was performed after the initial 
ratings and initial estimates had already been 
made. 

The same kind of analysis of the ratings and 
estimates made after the discussion will show 


TABLE 8 
ACCURACY OF ESTIMATION OF PREFERENCE AFTER THE 
DISCUSSION 


Estimate of Others’ Preference 


. dias for Themselves 
Condition 


lor2) 3 4 5 6o0r7 
Compatible (M) 5.1} 4.9| 5.0] 4.9 | 5.3 
Incompatible (M) | 4.5 | 4.9 5.1 | 5.0) 5.7 
Compatible (L) 4.6 | 4.3 | 4.5 | 4.9 | 5.3 
4.1/4.5 | 4.6 | 4.9 | 5.4 


Incompatible (L) 





us whether or not Ss learned something during 
the discussion about how others felt toward 
them. These data are presented in Table 8. 

In Table 8 there is clear evidence that Ss in 
the incompatible condition had significant 
knowledge, after the discussion, of how others 
in the group felt about them. This is true for 
both countries. The average rating received 
from those they think dislike them is signifi- 
cantly lower than the average ratings received 
from those they think will rate them 3, 4, or 5. 
These differences are significant at the .05 
level or better in each country. Those that the 
incompatible Ss think rate them 6 or 7 (like 
very much) actually do give them higher 
average ratings. In each country, these 
average ratings are significantly higher than 
any of the others (p < .05). 

The evidence is not nearly as clear that Ss 
in the compatible condition have knowledge 
of how others feel about them. In neither 
country is there any difference among the 
average ratings received from those who Ss 
thought rated them anywhere from 1 (dis- 
likes) through 5 (likes me moderately). Those 
that the compatibles think rated them 6 or 7, 
however, do give them somewhat higher 
ratings. In the Leiden data the differences 
between this average rating received from 
those they think will rate them 6 or 7 and any 
of the other average ratings are significant 
(p < .02). In the Minneapolis data the dif- 
ferences are smaller and only the difference 
between the average ratings from those who 
Ss think will rate them 6 or 7 and those they 
think will rate them 4 or 5S is significant at the 
.05 level. But still, the data for both countries 
are extremely similar. The compatibles seem 
to have correct knowledge of how others feel 
about them only for those who they think 
like them very much. The incompatibles, on 
the other hand, also have correct knowledge 
about those they think dislike them. 

The differences between the two conditions, 
while in the same direction for both countries, 
are larger for the Minneapolis data than for 
the Leiden data. In the Minneapolis data the 
difference between the compatible and in- 
compatible conditions in the average rating 
that Ss receive from those they think dislike 
them is significant at the .10 level of confidence; 
between those they think like them very much 
it is significant at the .06 level of confidence. 
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The conjunction of both these differences is 
certainly highly significant. In the Leiden 
data the difference between the compatible 
and incompatible conditions in the average 
rating that they receive from those they think 
dislike them is significant at only the .20 
level of confidence. No other differences 
between conditions approach significance. The 
fact that all the differences are in the same 
direction in both countries, however, lends 
strength to the assertion that the difference is 
a real one. 

In comparing the data between the two 
countries one may note that the ratings made 
by the Minneapolis Ss are consistently higher 
than the ratings made by the Leiden Ss. 
Nothing can be made of this, however, since 
one cannot be certain that the translation of 
the rating scales from English into Dutch 
preserved intact the distances between points 
on the scale or the exact location of each point 
on the scale. 

It will be recalled that in the introduction 
the hypothesis was advanced that Ss in the 
incompatible condition, because of their 
instability and uncertainty about their 
relations with others in the group, would be 
more sensitive to the cues that others were 
giving and would know more accurately how 
others in the group felt about them. This part 
of the hypothesis has, in general, been con- 
firmed by the data. It was also stated in the 
introduction that one would expect Ss in the 
compatible condition, because of their greater 
certainty in the situation, to be more able to 
directly express their feelings toward others. 
If this hypothesis were true we would expect 
to find that Ss knew the feelings of those in the 
compatible condition more accurately than 
they knew the feelings of those in the incom- 
patible condition. This is net borne out by 
the data. There is no difference in accuracy of 
knowledge about the preferences of Ss in the 
two conditions. 

On the other hand, in Table 4 we observed 
that there was some difference between the 
two conditions in the nature of the cues 
concerning their preferences for others. Those 
in the compatible condition spoke more to 
those they disliked and less to those they 
liked than did Ss in the incompatible condition. 
These data are still consistent with the 
notion that Ss in the compatible condition 
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were more able to directly express their 
dislikes. However, since no difference was 
found in accuracy of knowledge about Ss in 
the two conditions, it seems necessary to as- 
sume that other, equally good cues were made 
available by Ss in the incompatible condition. 
At any rate, the evidence from the experiment 
gives no direct support to the second part of 
our initial hypothesis. 


SUMMARY AND CONCLUSIONS 


The major findings from the experiment 
which has been described may be summarized 
as follows: 

1. If persons in a group feel that those 
members of the group whom they like best 
dislike each other, this tends to make them 
uncertain and unstable about their inter- 
personal relations in the group. 

2. Both those who are stable and those who 
are unstable about their personal relations 
reveal something about how much they like 
others in how often they talk to them. They 
tend to talk least to those toward whom they 
feel indifferent. 
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3. There is some difference between the two 
conditions in how sensitive individuals are to 
the cues which others give off. Those who feel 
unstable about their personal relationships 
tend to have more accurate knowledge of how 
others feel about them. It is perhaps plausible 
to infer that those who are unstable about their 
personal relations watch more closely and are 
able to make better use of the available cues. 

4. The results in both countries, Holland 
and the United States, are remarkably similar. 
The consistency of the results tends to 
strengthen confidence in their significance and 
also tends to show that these particular 
findings are independent of the cultural 
differences between these two countries. 
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PROJECTION AND DISPLACEMENT: A CROSS-CULTURAL STUDY 
OF FOLK-TALE AGGRESSION! 
GEORGE O. WRIGHT 
Graduate School of Education, Harvard University* 


HE stuDY of societies reveals many 

examples of the use of folk tales and 

rituals which are aggressive in content. 
In varying degrees this aesthetic aspect of cul- 
ture seems to provide a device by which ag- 
gressive feelings can be displayed in fantasy 
without the disruptive effects which otherwise 
would result from their direct expression. This 
fantasy aggression, however, presumably has 
some correspondence with real-life aggression. 
In order to understand folk-tale aggression, 
therefore, we will attempt to discover the re- 
lationship between specific antecedent condi- 
tions and the specific outcomes with which 
they are associated in folk tales. 

Part of the theoretical framework for the 
study of folk-tale aggression grows out of the 
approach-avoidance theory of Miller (1). This 
theory gives us a scheme for appraising the 
joint effects of approach and of avoidance in 
conflict situations. While the experimental 
evidence supporting the approach-avoidance 
theory originally came from work with ani- 
mals, a reasonable elaboration will transform 
its application from psychophysical phenomena 
to those which are psychocultural. 

One of the important elaborations of the 
approach-avoidance theory is the result of the 
work of Whiting (3, 4). Whiting’s hypothesis 
asserts that the intensity of the displaced re- 
sponses, as well as the cultural distance of 
displacement, is a function of anxiety. This 
hypothesis is confirmed in the doll-play inves- 
tigations in which it was found that the dis- 
play of large amounts of aggressive behavior 
in fantasy is connected with a high degree of 
anxiety arising from previous punishment for 
aggression (2). 

Another elaboration of the approach-avoid- 
ance theory is the result of the work of Whit- 


1 This material is part of a doctoral dissertation sub- 
mitted by the author to the Graduate School of Educa- 
tion, Harvard University, 1952. The author wishes to 
express his appreciation to Dr. John W. M. Whiting 
for his guidance in carrying out the research on which 
this paper is based. 
hej? Now at the Wright Air Development Center, 
U. S. Air Force. 
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ing and Child (4), in which the original theory 
as it applied to displacement is extended to 
projection. This work, which examined the 
relationship between aggression anxiety and 
the theories held by members of the society 
regarding the causes of disease, attested the 
validity of the theory and provided further 
evidence that such a theory, in fact, could be 
useful in understanding cultural phenomena. 
The work of Whiting and Child did not 
independently test displacement and projec- 
tion. This present study, however, does make 
this independent test by analyzing folk-tale 
behavior. The choice of folk tales for study was 
somewhat dictated by the fact that such an 
analysis could be carried out on the basis of 
classifying the behavior of the characters in 
the folk tales as either displacement (qualities 
of the objects of aggression) or projection 
(qualities of the agents of aggression). 


PROCEDURE 


The material used in this study has been developed 
from an analysis of folk tales from 33 societies, includ- 
ing selections from Oceania, Asia, Africa, North 
America, and South America. Since the child-training 
data of Whiting and Child (4) were to be used as one 
of the variables, folk tales had to be assembled for the 
societies that were among the 75 used in their study. 
The folk-tale literature of these 75 societies was ex- 
amined, and preliminary selections on the basis of 
available material from 37 societies were made. At the 
outset it was decided not to include origin legends be- 
cause of their lack of comparability with ordinary folk 
tales. For the same reason, those folk tales told ex- 
clusively in the third person with the ethnographer as 
narrator were excluded, since they were likely to be 
overlaid with the cultural bias of the ethnographer. In 
consequence of one or the other of these restrictions, 
the folk tales from four of the initial selections were 
discarded. These societies were the Ainu, Arapesh, 
Azande, and the Maori. 

In order to provide sufficient data for analysis, twelve 
folk tales were selected from each society. Mcreover, 
in order to eliminate the inclusion of fragmentary tales, 
each of the folk tales selected was required to be a 
complete episode. An episode was defined as a sequence 
of action having a definite beginning and proceeding 
to some definite action climax. 

In order to carry out the study, a scheme permitting 
quantitative analysis of the folk tales was needed. In 
addition, it was necessary that the scheme comprise ob- 
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jective scoring categories so that the subjective esti 
mates of the investigator would be minimized. These 
requirements are met in the scoring scheme used by 
the doll-play investigators. Their scheme was modified 
for use with folk tales. 

Central to the scoring system is the concept of 
interaction units. As the term implies, an interaction 
unit is a complete behavior sequence between two 
persons. In order to be complete, the sequence must 
contain at least two action elements. First, a person 
must initiate an action toward some other person. 
Second, this initiated action must be responded to by the 
other person. For example, a mother initiates action by 
calling her child to lunch. The child responds to this 
action by stopping his play and coming into the house 
The first element we can call imitiation; the second, 
response. The person starting the action we can call 
the initiator; the person responding to the action we 
can call respondent. 

Now, since many of the actions are likely to be more 
than simple action-reaction, we have decided on a con- 
vention for complex actions. In our example of the 
mother calling the child to lunch, how will the action 
be scored if the child were to say, “I’m busy, mother, 
and can’t come in now”? What is the action of the 
mother to be called if she in turn responds to the reply 
of the child by a threat of spanking? The convention 
adopted is to score the calling of the mother and the 
reply of the child as one interaction unit, with mother 
as initiator and child as respondent. In turn, the reply 
of the child and the threat of the mother are scored as 
the second interaction unit, with the child as initiator 
and the mother as respondent. If the threat of punish- 
ment causes the child to come into the house, this is 
scored as the third interaction unit, with mother as 
initiator and child as respondent. Allowing a reaction 
to be both a response and an initiation solves our 
scoring dilemma. 

When the interaction unit is used as the vital 
measure in the scoring system, it is possible to build 
additional classifying measures around this unit. For 
example, it is possible to classify the status of the 
initiator, the status of the respondent, the nature of 
the initiation, and the nature of the response. It is pos- 
sible, moreover, to make comparisons from folk tale to 
folk tale by the use of the quantitative measures. 

The interaction sequences were scored as aggressive 
or nonaggressive; each aggressive act was scored for 
intensity (ranging from a score of 1 for nonaccomplish- 
ment of aggressive intent to a score of 5 for killing); 
and finally, each sequence was classified as to the agent 
and object of the act. 

In order to test the reliability of the scoring system, 
four students at the Wilberforce State College (Ohio) 
were asked to score 36 folk-tale episodes selected at 
random from the collection. These students were trained 
in the use of the system for five hours prior to the 
scoring. An index of reliability was computed as fol- 
lows: the sum of the agreements for a pair of the 
student scorers was divided by the total number of 
items scored by them; the scores of the writer and of 
the students were randomly paired in all possible 
combinations for each of the scoring categories; the 
resultant index of reliability was 0.83, with 1.0 repre- 
senting perfect agreement. 

The scoring of the folk tales provides a compre- 
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hensive set of data on the display of aggressive feelings 
in folk tales. This is our consequent variable. Data also 
were needed for the child-training practices, our ante- 
cedent variable. For this antecedent variable, the 
child-training data of the study of Whiting and Child 
(4) were used. These data, in the form of ratings on 
seven-point scales, were available for 26 features of 
child-training practices. The details of the rating scale 
for the practice that is relevant to the present study 
are as follows (the numbers indicate the ends of the 
rating scale) : 
Severity of Aggression Training 
7--Child severely punished for any form 
of aggression; fighting, disrespect, or dis- 
obedience not permitted. 
1—All but the most severe forms of ag- 
gression permitted; fighting encouraged. 

In analyzing the data, the societies were divided 
into two groups, high and low, on the basis of the rat- 
ings for the child-training practice. The data were then 
analyzed for significance of difference between means. 


THE EFrFrect OF PUNISHMENT ANXIETY ON 
DISPLACEMENT 


The first hypothesis to be tested is that ob- 
jects of displaced aggression are chosen farther 
and farther out on the generalization con- 
tinuum as a function of the amount of aggres- 
sion anxiety. 

This hypothesis is derived from the ap- 
proach-avoidance theory, which considers the 
joint habit strength of aggression and ag- 
gression anxiety to be the algebraic difference 
between the two tendencies. As the degree of 
anxiety is increased, this anxiety function 
shifts the point of maximum response toward 
objects less and less like the original object. 

The approach-avoidance theory specifies 
where on the generalization continuum the 
maximum response is likely to occur, but 
Whiting’s theory specifies also how strong this 
maximum response is likely to be. This theory 
asserts that anxiety is frustrating, that is to 
say, it interferes with the initial response, and 
that this interference is itself drive-producing. 
Consequently, the joint effective habit poten- 
tial is a function of the product of the joint 
habit strength (i.e., of the difference between 
the aggression and the aggression anxiety) and 
the strength of frustration as indicated by 
aggression anxiety. 

In order to translate this hypothesis into 
terms that can be tested with the folk-tale 
data, the punishment of children for aggression 
will be called aggression anxiety; the Hero will 
be called the point of original response; the 
similarity dimension will be considered as be- 
ginning at the Hero and extending through 








PROJECTION AND DISPLACEMENT IN FOLKLORE 


friends, relatives, and acquaintances to stran- 
gers at the least similar position; and finally, 
the frequency of aggressive acts involving a 
given category along the dimension of similar- 
ity will be called strength of response. Further, 
we will consider our hypothesis supported if 
the following conditions are found: 

1. In the folk tales of a society in which the 
child-training practice is low severity of pun- 
ishment for aggression, the frequency with 
which friends of the Hero are used as objects 
of aggression should be higher than it is in the 
folk tales from societies in which the practice 
is high severity of punishment for aggression. 
This condition should be found because our 
theory predicts that the lower the anxiety, the 
more like the original object will be the dis- 
placed objects. 

2. Conversely, in the folk tales of societies 
in which the child-training practice is high 
severity of punishment for aggression, the 
frequency of the use of strangers as objects of 
aggression should be higher than in those of 
societies in which the practice is low severity 
of punishment for aggression. This condition 
should be found because our theory predicts 
that the higher the anxiety, the less like the 
original object will be the displaced objects. 

The data used for this analysis were the 
antecedent child-training measures of severity 
of punishment for aggression versus the fre- 
quency of use of the friends of the Hero as 
objects of aggression. The scores of frequency 
of use of friends as objects of aggression were 
obtained by counting the number of times the 
friend is used as the object of aggression in 
the folk tales, and dividing this frequency by 
the total aggressive acts. 

Analysis of the data indicates that for the 
group of societies in which aggression anxiety 
is low (low severity of punishment for ag- 
gression), the mean frequency of the use of 
friends of the Hero as the object of aggression 
is 13.3, whereas the mean of the group of 
societies in which aggression anxiety is high 
(high severity of punishment for aggression) is 
8.67. The difference is significant at the .005 
level (obtained ¢ = 3.8).* The results of this 
and the following tests are shown in Table 1. 

The use of the measures for the friends of 
the Hero as the object of aggression instead of 


3 In all of the results reported, only one tail of the 
t distribution is used, since all of the results are in the 
predicted direction. 


TABLE 1 
SUMMARY OF FINDINGS 


Aqueaien 
Mean Frequencies of _— t p 
Aggression : . 
High Low 
N=12N=14 


Agent of aggression 
Hero as agent 
Friend as agent 
Stranger as agent 

Object of aggression 
Hero as object 


22.3 23.0 | 0.21 | .25 
15.33 18.54 1.18  .13 
30.5 25.43 2.46 .01 


27.33 29.43, 0.47 | .30 


Friend as object 8.6713.3 | 3.8 .005 
Stranger as object 31.66'26.58) 2.21 | .015 
Intensity of aggression 19.58 15.7 | 2.02 | .035 
Hero triumphant 2.14 4.85) 2.86 | .005 


Note.—This table shows mean scores for frequencies of various 
categories of agents and objects of aggression, for intensity of 
aggression, and for fate of the hero in the folk tales. (Societies are 
dichotomized on the basis of two degrees of aggression anxiety, 
high and low.) 


the Hero himself is a question that naturally 
arises. In applying the test of the hypothesis, 
it was felt that the Hero as the object of ag- 
gression actually represented self-aggression, 
and that this fact might introduce this effect 
as an uncontrolled variable. Consequently, the 
objects considered closest to the Hero, namely, 
his friends, were chosen. 

In order, however, to provide information 
that might be useful in refining the hypothesis, 
the analysis of the data on the Hero was carried 
out. In this analysis, the means of the frequency 
of use of the Hero as object of aggression 
were compared for the high aggression-anxiety 
group of societies and the low aggression- 
anxiety group of societies. The difference is 
not significant—that is to say, the difference 
in the means which we observed are likely to 
have arisen by chance. The obtained ¢ is 0.47; 
the significance level is approximately .30. 

The test of the converse condition, that of 
comparing the frequency of the use of stran- 
gers as the object of aggression under the 
conditions of high aggression anxiety and of 
low aggression anxiety, discloses that the 
mean of the frequency of use of strangers as 
the object of aggression for societies in which 
aggression anxiety is high is higher than in 
societies in which aggression anxiety is low. 
The means are respectively 31.66 and 26.58. 
The difference between the means is significant 
at approximately the .02 level (obtained 
t = 2.21). Thus, in sum, the societies in which 
children are severely punished for aggression, 
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as compared to the mild societies, represent 
friends of the Hero Jess frequently and stran- 
gers more frequently, as would be expected 
from the displacement hypothesis. 


Tue Errect OF PUNISHMENT ANXIETY ON 
PROJECTION 


The second hypothesis to be tested is that 
the agents of projected aggression are chosen 
farther and farther out on the generalization 
continuum as a function of the amount of 
aggression anxiety. 

In order for our theory to be consistent, the 
findings for displacement should have their 
counterpart in projection. The child-training 
measures of punishment aggression, used as 
indices of aggression anxiety, should be related 
to projection in such a way that the folk tales 
of societies in which punishment for aggression 
is severe should display more aggression per- 
formance that uses the stranger as agent, since 
the tendency to express aggression under these 
conditions should be projected far out on the 
generalization continuum. Stated another way, 
high aggression anxiety should be associated 
with a more frequent use of stranger as ag- 
gressive agent. 

The folk-tale data translated into terms of 
our theory are analyzed, using severity of 
punishment for aggression as the degree of 
anxiety about aggression, and the frequency 
of the use of various categories as agents of 
aggression; these were considered an index of 
the strength of the joint effective habit of ag- 
gression and aggression anxiety. We will con- 
sider our hypothesis as supported if the 
following conditions are found: 

1. In the folk tales of societies in which the 
child-training practice is low severity of pun- 
ishment for aggression, the frequency of use of 
the Hero as agent of aggression should be 
higher than in those of societies in which the 
practice is high severity of punishment for 
aggression. This condition should be found be- 
cause our theory states that the lower the 
anxiety, the more like the original object will 
be the projected agents. This condition should 
also be found for the use of friends of the Hero 
as agents of aggression. 

2. In the folk tales of societies in which the 
child-training practice is high severity of 
punishment for aggression, the frequency of 
use of strangers as agents of aggression should 
be higher than in folk tales from societies that 
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practice low severity of punishment for ag- 
gression. This condition should be found be- 
cause our theory states that the higher the 
anxiety, the less like the original objects will 
be the projected agents. 

The data used for this analysis were the 
antecedent child-training measures of severity 
of punishment for aggression versus the fre- 
quency of the use of the Hero as agent of 
aggression. The scores of frequency of use of 
the Hero as agent of aggression were obtained 
by counting the times the Hero was used as 
agent of aggression in the folk tales, and 
dividing this by the total aggressive acts. 

Analysis of the data indicates that the mean 
frequency of use of Hero as agent of aggression 
in the high aggression-anxiety societies is 
22.3, whereas it is 23.0 in the low aggression- 
anxiety societies. The difference between these 
means, though in the predicted direction, is 
not significant (p is approximately .25; ob- 
tained ¢ = 0.21). 

Analysis of the data for the use of friends 
of the Hero as agent of aggression, using a 
procedure similar to that for the Hero, indi- 
cates that the difference between the means of 
frequency of use of friends as agents of ag- 
gression is also in the predicted direction, but 
is not significant (p is approximately .13; 
obtained ¢ = 1.18). 

In the converse condition, namely, high 
aggression anxiety accompanied by the more 
frequent use of the stranger as agent of ag- 
gression, analysis of the data indicates that 
for the group of societies in which aggression 
anxiety is high, the mean frequency of use of 
stranger as agent of aggression is higher than 
that of the group of societies in which the ag- 
gression anxiety is low. The mean frequencies 
are, respectively, 30.5 and 25.43. The difference 
between the means is in the predicted direction 
and is significant at the .01 level (obtained 
t = 2.46). 

The findings discussed here tend to support 
the theorems and to suggest that the individual 
may, in fact, use psychologically harmless 
ways to express his aggression in the presence 
of aggression anxiety. It is suggested, more- 
over, that the individual apparently chooses 
harmless agents and objects of aggression on 
the basis of their categorical differences from 
the agents and objects associated with the 
initial anxiety. The findings also suggest that 
two qualifications are essential in the inter- 
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pretation of these data. Both relate to the dis- 
crete placement of the cultural categories 
along the dimension of similarity. 

The first qualification is that the present 
theory, which relates anxiety to projection 
and displacement, is not refined enough to 
predict the characteristics and the behavior of 
the Hero in folk tales as a function of the anx- 
iety of the society. The explanation proposed 
is that the position which the Hero occupies 
on the generalization continuum is theoreti- 
cally influenced by the transition between real 
life and fantasy and that more things must be 
known about the influence of complex psy- 
chological factors in this transition area before 
our predictions will gain accuracy. 

The other qualification is that a precise 
placement of the intermediate categories of 
relatives and acquaintances cannot be made 
with the use of our presently developed theory. 
The explanation of this condition probably 
lies in the complexity of meaning which various 
societies attach to cultural categories. It is 
possible, for example, that in one society the 
similarity of relatives to strangers might be 
perceived to be greater than between ac- 


quaintances and strangers, while in another 
society the opposite might be true. 


THe Errect oF AGGRESSION ANXIETY ON 
INTENSITY OF AGGRESSION 


Another aspect of the relationship of ag- 
gression anxiety to fantasy aggression is of 
interest, namely, the apparent influence of the 
strength of anxiety on the intensity of aggres- 
sion in folk tales. Our theory suggests that the 
aggression anxiety will diminish at a more 
rapid rate than the intensity of the aggression 
as displacements and projections are made 
farther and farther vut on the generalization 
continuum. We should be able to foretell from 
this theory that fantasy, since it represents 
situations in many ways dissimilar from real 
life, should provide a vehicle for the expression 
of aggression, and should do so with an inten- 
sity which is directly related to the amount of 
real-life punishment for aggression. 

An examination of this theory is made in 
the third hypothesis to be tested, namely, that 
punishment for aggression increases the in- 
tensity of aggression in situations which are 
distinctly dissimilar from those in which the 
punishment occurs. 

This hypothesis, translated into terms which 
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are applicable to the folk-tale data, states that 
the punishment of aggression leads to aggres- 
sion anxiety. This anxiety reduces the display 
of aggressive behavior toward objects similar 
to those associated with the initial punish- 
ment. As we have seen, in such situations 
displacement and projection of aggression are 
likely to occur. Moreover, the strength of the 
joint effective habit of the aggression and of 
the aggression avoidance is thought to be the 
product of the difference between the two 
tendencies and the degree of avoidance. We 
should expect, therefore, that the greater the 
anxiety, the greater the intensity of the fan- 
tasy aggression. 

Our hypothesis will be supported if it is 
found that the folk tales of societies in which 
high aggression anxiety is generated by severe 
punishment for aggression show a greater in- 
tensity of aggression than exists in the folk 
tales of societies in which low aggression anx- 
iety is generated. 

The data used for this analysis were the 
same child-training measures of severity of 
punishment for aggression used in the previous 
analysis related to the intensity of the folk-tale 
aggression. The scores for intensity were ob- 
tained by taking the sum of the intensity 
measures for each act of aggression in the folk 
taies of a given society and dividing this sum 
by the total acts of aggression in the folk tales. 

An analysis of the data indicates that the 
mean of the intensity of aggression for the 
group of societies practicing high severity of 
punishment for aggression is 19.58, whereas 
the mean for the group practicing low severity 
of punishment for aggression is 15.7. The 
difference between the means is in the pre- 
dicted direction and is significant at approxi- 
mately the .03 level (obtained ¢ = 2.02). 


Tue EFFECT OF AGGRESSION ANXIETY ON THE 
FATE OF THE HERO 


There is one additional analysis by which 
information may be disclosed about the fan- 
tasy structuring of the aggression in folk tales, 
namely, the relationship between aggression 
anxiety and the fate of the Hero in the folk 
tales. Actually, such an analysis provides an- 
other test of the kind and amount of projection 
and displacement which develop as the con- 
sequent variable of aggression anxiety. The 
fate of the Hero, moreover, is likely to be a 
more sensitive measure of the expression of 
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fantasy aggression than those involving sec- 
ondary characters. 

An examination of this theoretical position 
is done in the fourth hypothesis to be tested, 
namely, that the Hero should triumph less 
often in folk tales of societies in which the 
child-training practices include severe punish- 
ment for aggression. This hypothesis is a re- 
statement of the second hypothesis that high 
aggression anxiety results in projection far out 
from the Hero on the generalization con- 
tinuum. This theorem is built on the assump- 
tion that the behavior of the Hero is under- 
standable as a projection of the individual. 
Consequently, in cases of high aggression anx- 
iety, it is to be expected that having the Hero 
triumphant is likely to be accompanied by 
large amounts of anxiety, so that triumphant 
characters of the folk tales are likely to be less 
like the Hero. In short, a small degree of tri- 
umphant-Hero aggression should accompany 
high aggression anxiety. 

The data used for this analysis were the 
child-training data of severity of punishment 
for aggression versus the frequency of Hero 
triumphant in the outcomes of the episodes. 

An analysis of the data for the fate of the 
Hero shows that the Hero is triumphant less 
frequently in the high aggression anxiety group 
of societies. The means are 2.14 and 4.85. 
The difference between the means is signifi- 
cant at the .005 level (obtained ¢ = 2.86). 

This finding is in accord with the earlier re- 
sults in which it was seen that a comparable 
relationship was true, namely, that of the more 
frequent use of the stranger as agent of ag- 
gression in folk tales of societies in which ag- 
gression anxiety is high. In both cases, the 
theoretical assumptions underlying the func- 
tioning of the phenomenon of generalization in 
conflict are found to be valid. 


SUMMARY 


1. The analysis of the folk-tale data supports 
the essential features of the theory which was 
developed to supply insight into the conse- 
quences and outcomes of action and behavior 
in folk tales. The approach-avoidance theory 
of Miller, restated in terms of Whiting’s modi- 
fication of it for the influence of conflict-pro- 
duced drives, provides a coherent basis for the 
explanation of the phenomena of projection and 
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displacement in folk tales. Admittedly, the pre- 
dictive powers of such a theory are not perfect, 
but the results obtained for the tests of the 
hypotheses are all in the predicted direction. 

2. In general, it is found that high punish- 
ment anxiety, when it is connected with the 
expression of aggression, leads to the kind of 
folk-tale behavior in which the objects and 
agents of aggression are far out on the generali- 
zation continuum and are least like the Hero. 
In addition, the Hero in these folk tales is not 
likely to be triumphant in the outcomes. 

3. In general, it is found that low punish- 
ment anxiety, when it is connected with the 
expression of aggression, leads to the kind of 
folk-tale behavior in which the objects and 
agents of aggression are chosen close to the 
Hero on the generalization continuum. In 
addition, the Hero is likely to be triumphant 
in the outcomes. 

4. High punishment anxiety, when it is con- 
nected with the expression of aggression, leads 
to the display of a more intense kind of ag- 
gressive behavior in folk tales than is true for 
low punishment anxiety. 

5. Finally, the study demonstrates that 
folk tales as fantasy material are fruitful for 
analysis. Enough consistency of trends has 
been found in the present study to suggest 
that the relationships of child-training prac- 
tices and the expression of aggression in folk 
tales are real and not accidental or spurious. 
This consistency and its value are somewhat 
enhanced by the fact that the variables of 
child-training practices used in this analysis 
are data from an independent investigation 
which were collected and analyzed by workers 
who were not directly involved in the scoring 
and analysis of the present folk-tale study. 
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A MOVEMENT INTERPRETATION OF THREAT?! 
ALVIN W. LANDFIELD 


Purdue University 


HE PRESENT investigation of threat was 

carried out within a theoretical frame- 

work recently advanced by Kelly (2). 
The independent and dependent variables, the 
relationship between them, and the method- 
ology employed in testing the experimental 
hypotheses are derivable from or are consistent 
with the following theoretical statements of the 
psychology of personal constructs. 


A person’s psychological processes evolve toward what 
he construes to be an optimal anticipation of events. 
Definitions: Processes— ...the subject of psychology 
is assumed at the outset to be a process. Evolve—... 
the patterns of psychological processes undergo pro- 
gressive change in themselves as they function. Toward 
—Directionality of processes is built into the funda- 
mental postulate. What he construes—The direction- 
ality of processes is assumed to be an essential function 
of prior outcomes of those same processes. The person 
himself sets the objectives and he does it, just as the 
psychologist must, by construing, by attributing sig- 
nificance, by framing events with meaning at some 
minimal level. Anticipation—The direction of mental 
processes is established as prediction. This is an extra- 
polation to the domain of the individual of the com- 
monly accepted notion regarding the aim of science 
being prediction and control. Optimal—. . . some sort 
of reconciliation of potentially contradictory anticipa- 
tions must be assumed as taking place in the individual 

. (2, pp. 29-31). 


Threat is the awareness of imminent compre- 
hensive change in oneself. Definitions: Compre- 
hensive change—Employed in the context of 
threat, it refers to a modification in the 
directionality of evolving psychologica! proc- 
esses. Restated, it refers to the change in the 
direction of change. Not all psychological 
change implies a modification in directionality. 

Self-movement, the independent variable, 
was chosen as one measure of an individual’s 
evolving psychological processes. Self-move- 
ment refers to the direction of change in self- 
conceptualization, inferred from the different 
ways an individual describes himself (a) as he 
was in the past, (6) as he is now, (c) as he 
would like to be in the future. The perception 
of threatening and nonthreatening acquaint- 


‘This paper is based upon a doctoral dissertation 
completed in 1951 at The Ohio State University under 
the direction of Dr. George A. Kelly. 
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ances, in terms of roles and expectancies, 
constituted the dependent variable. Defi- 
nitions: Role—the way an individual rates 
another person. Expectancy—the way an 
individual thinks another person might rate 
him. Threatening acquaintance—one whom 
an individual wishes to avoid or whose 
behavior he would like to modify greatly. 
Nonthreatening acquaintance—one with whom 
an individual wishes to have contact and whose 
behavior he would not care to modify greatly. 
The description of acquaintances in terms of 
roles and expectancies is consistent with our 
theoretical framework. Roles and expectancies 
are common ways of anticipating or predicting 
other people. 

Since we have employed self-movement as 
one measure of evolving psychological proc- 
esses, threat may be conceptualized as an 
imminent change in the directionality of self- 
movement. Therefore, according to our 
theory, the threatening acquaintance described 
in terms of roles and expectancies is threatening 
because interaction with him might change the 
directionality of self-movement. Now if threat 
is related to the change in direction of’ self- 
movement, what kinds of roles and expect- 
ancies might lead to such a change? Our 
hypothetical answer to this question may be 
stated as follows: The threatening, other 
person will be perceived by an individual as 
being more like himself as he was in the past, 
or is now but no longer wants to be, than the 
nonthreatening person. Also, the threatening 
other person will be seen by an individual as 
expecting him to be more like he used to be, or 
is now but no longer wants to be, than the non- 
threatening person. The latter formulation, 
known as the expectancy hypothesis, may be 
readily illustrated. The adolescent is threat- 
ened by his parents, not merely because he feels 
he is sometimes perceived as a child, but, be- 
cause the parents seem to perceive him as a 
child, he might begin acting like a child. Because 
the parental expectancy is so plausible, the 
directionality of his self-movement toward 
maturity could easily be reversed. 

If we can show that a relationship exists 
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between the axes along which an individual 
perceives himself changing and his perceptions 
of threatening people, then we may conclude 
that our findings tend to support the more 
comprehensive formulation of threat within the 
psychology of personal constructs. 


METHOD 


A social concept test (1) and rating scales derived 
from this social concept test were employed as research 
tools. In the social concept technique, a subject (S) 
writes the names of certain specified persons he knows, 
30 in all, on separate cards. These cards, a name on 
each one, are then dealt to S three at a time, in certain 
combinations. As three cards are dealt, S is asked, “In 
what outstanding way are two of these persons similar 
and different from the third?” From S’s responses to 
this question, rating scales may be created, scales which 
are derived from S’s descriptions of these people and are 
expressed in his own terminology. For example, S may 
state that two of the persons are “gregarious” and the 
other person is “not gregarious.” The terms “gre- 
garious” and “not gregarious” are placed at either end 
of a nine-point rating scale. It should be emphasized at 
this point that Ss do not always give opposite concepts 
in expressing a similarity and difference between three 
people. To insure opposite concepts (i. e., opposite to 
S), the examiner, when in doubt, simply asks whether 
or not the two descriptions are opposites. If the descrip- 
tions are not in opposition for S, he is asked to state 
their opposites. 

The S is then asked to place two or three persons 
with whom he is acquainted at points on each scale, 
depending upon whether he sees the scale as dichoto- 
mous or dimensional. Whenever S feels he can use only 
the poles of his scale, it is considered to be dichotomous 
and only the poles of the scale are used for ratings. Two 
referent persons then are placed on the scale, one at 
each end of the nine-point scale, and scale points 2 
through 8 are crossed out. Whenever S perceives people 
distributed across his scale, it is considered dimensional, 
and all points of the scale may be used for the ratings. 
Three referent persons then are placed on the scale, one 
at each end of the nine-point scale and one in the middle. 
In other words, scale points are represented by people. 
(See Fig. 1.) 

Both exploratory and validation designs were em- 
ployed. Fifteen adult Ss were used in the exploratory 
phase of the study. Fifty-four volunteer undergraduate 
male and female Ss, aged 18 through 24, drawn from 
economics and psychology classes at The Ohio State 
University, were used in the validation study. 

The social concept test was administered individually 
during one experimental session to each S. The 30 names 
used in this technique were specified by the experi- 
menter (£) in order to insure adequate sampling of 
persons with whom an S is acquainted. For example, 
the following people were specified: teacher you like, 
mother, father, sister nearest your age or person like a 
a sister to you, person with whom you would not care 
to go on a vacation; etc. 

The rating scales derived from S’s responses to the 
concept test were administered individually during a 
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second experimental session, at which time S was asked 
to: (a) rate himself on his own scales as he would like 
to be six years from now, if it were possible for him to be 
anything he wished; (5) rate himself as he feels he is 
now; (c) rate himself as he feels he was at some period 
during the past six years when he was most different 
from the way he is now; (d) think of a threatening and 
a nonthreatening person and rate each of them in terms 
ot roles and expectancies. Threatening and nonthreaten- 
ing persons were of the same sex as S. 

As can be seen in Fig. 1, all ratings were done on a 
rating sheet and the rating scale labels were placed on 
a separate overlay sheet. The S places the right-hand 
edge of the first overlay sheet to the left of the first 
column, and using the name designations as reference 
points, rates the first six scales by encircling the proper 
numbers in the first column on the rating sheet. The S 
then takes the second overlay sheet with a different 
six scales and the second rating sheet and continues 
rating in this first column. After all scales have been 
rated in the first column, S takes the first overlay sheet 
again, moves it one column to the right to the second 
column, thereby covering over the first column, and 
proceeds as before. This procedure is continued until S 
has completed all columns. 


SELF-MOVEMENT 


= Employing the role and expectancy descrip- 
tions of the threatening and nonthreatening 
persons as dependent variables, we attempted 
to predict these descriptions from the direc- 
tionality of self-movements, inferred from the 
change in self-description from past, to present, 
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to future. A self-movement is inferred to have 
occurred within a single concept scale when 
there is a numerical difference between the 
past and present or present and future ideal 
self-ratings. It should be noted that the 
movement shown in Fig. 1 is from right to 
left. Our use of the overlay sheet necessitated 
this arrangement. 

Self-movement may occur between the past 
and the present, directionality up or down; 
between the present and the future ideal, 
directionality up or down; and between the 
past and future ideal, directionality up, down 
or up and down (i.e., the movement between 
both past and present and present and future 
ideal may be up. It also may be down. Then 
again, one movement may be up and the other 
down. The latter is called bidirectional 
movement). (See Fig. 2.) 

The independent variable more specifically 
was a unidirectional self-movement within a 
concept scale. Where the movement from past 
to present to future ideal is unidirectional, 
prediction simply is made from the direc- 
tionality of this unidirectional movement. 
(See Fig. 3.) 

Where the movement from past to present 
to future ideal is bidirectional, prediction is 
made from the longest unidirectional move- 
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ment. (The frequency with which bidirectional 
movement occurred was small. Movement 
usually occurred only from past to present, 
only from present to future, or in the same 
direction from past to present to future.) (See 
Fig. 4.) 

In the few cases in which the two opposite 
movements are the same in length, prediction 
is made from the last movement (Fig. 4). 

As we previously stated, the threatening and 
nonthreatening other persons were described in 
terms of roles and expectancies. These de- 
scriptions, expressed as numerical ratings, were 
predicted from the direction of a unidirectional 
self-movement, relative to each concept scale. 
In other words, a set of predictions was made 
for every concept scale within an S’s scale 
repertoire in which movement occurred. 


EXPERIMENTAL HYPOTHESES 


Role hypothesis. The Ss will attribute the 
role rating nearest the end of the concept 
scale toward which the unidirectional self- 
movement is pointing more often to the non- 
threatening person than to the threatening 
person. 

Expectancy hypothesis. The Ss will attribute 
the expectancy rating nearest the end of the 
concept scale toward which the unidirectional 
self-movement is pointing more often to the 
nonthreatening person than to the threatening 
person. 

The following example may clarify these 
hypotheses. An S has created 15 concept 
scales within which self-movement may be 
inferred. If the direction of movement suc- 
cessfully predicts the role ratings in ten scales 
and fails to predict the role ratings in five 
scales, it may be inferred that successful 








532 


predictions exceed unsuccessful predictions, 
and accordingly, the S conforms to the hy- 
pothesis. However, a significant number of 
Ss must follow the same pattern befo1 > the role 
hypothesis is supported. 


RESULTS 


Forty-eight Ss attributed the role rating 
nearest the movement-designated end of the 
scale more often to the nonthreatening person. 
Three subjects attributed the role rating more 
often to the threatening person. In three cases 
successful predictions equaled unsuccessful 
predictions. These latter cases were handled 
statistically by arbitrarily placing one and one- 
half cases with the successful prediction group 
and one and one-half cases with the un- 
successful prediction group. Chi square, with a 
50:50 theoretical frequency and 1 degree of 
freedom, was employed in the analysis. The 
statistical results (x? = 37.5) allow the null 
hypothesis to be rejected at better than the 
.001 level of confidence. 

Forty-nine Ss attributed the expectancy 


rating nearest the movement-designated end of 
the scale more often to the nonthreatening 
person. One S attributed the expectancy rating 
more often to the threatening person. In four 


cases, successful predictions equaled un- 
successful predictions. These latter cases were 
handled statistically by arbitrarily placing two 
cases with the successful prediction group and 
two cases with the unsuccessful prediction 
group. Chi square again was employed. The 
statistical results (x? = 42.7) allow the null 
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hypothesis to be rejected at better than the 
.001 level of confidence. 


SUMMARY AND CONCLUSIONS 


The results of this study support the 
hypothesis that a highly significant relation- 
ship exists between the axes along which an 
individual perceives himself changing and his 
perceptions of threatening people. More 
explicitly stated, the threatening person is 
perceived by an individual as being more like 
himself as he was in the past, or is now but 
no longer wants to be, than the nonthreatening 
person. Also, the threatening person is 
perceived by an individual as expecting him to 
be more like he used to be, or is now but no 
longer wants to be, than the nonthreatening 
person. Within our theoretical framework, 
these findings may be interpreted further to 
mean that the threatening person is placed by 
the threatened individual in the unique 
position of being able to reverse the individual’s 
direction of movement, to interrupt his 
movement, or at least to place him under 
additional pressure to resist such modification 
of his movement. 
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THE COLOR-FORM RESPONSES OF NORMAL, PSYCHOTIC, 
AND NEUROTIC SUBJECTS 


J. D. KEEHN! 
Maudsley Hospital, London 


NVESTIGATIONS into the color-form atti- 
tudes of various special and clinical 
groups have been reported by several 

writers since Kuhlmann’s (17) study of defec- 
tives in 1904. In addition to the large number 
of studies of normal adults and children (see 
Lindberg [18], Eysenck [7, 8], for reviews), 
there have been investigations of the color- 
form reactions of defectives (2), organics (11), 
spastic children (6), epileptics (1), psychotics 
(3, 4, 9, 15, 18), and neurotic patients (5, 7, 19). 

Altogether some seven different types of 
color-form tests have been employed by these 
workers. Time of sorting by color or form was 
used by Kuhlmann (17) and by Eysenck (9), 
Kibler used differential memory of tachisto- 
scopically exposed colored shapes, and Braat 
(3) instructed his subjects (Ss) to match a 
given figure with a set of figures shown tachis- 
toscopically. Grouping of symbols by color 
or shape formed the basis of tests employed by 
Goldstein and Scheerer (11), Cotton (6) and, 
in a modified form, by Eysenck (7). A test 
involving the ranking first of a set of colors, 
then a set of shapes, and third a selection of 
colored shapes—such that the most preferred 
shape appeared in the least preferred color— 
was given to a group of neurotics by Eysenck 
(7), and sorting tests were devised and adminis- 
tered by Lindberg (18), Clarke (5), and 
Eysenck (9). Finally the Ring test, designed 
and described by Lindberg (18), was used by 
Lindberg (18, 19), Campailla (4), Eysenck 
(7), and Clarke (5). 

It was not until Eysenck (7) reported that 
the average intercorrelation between the three 
tests used by him was only .24 that the sug- 
gestion that the various color-form tests might 
not all be measuring the same thing was put 
forward. Thus Eysenck speculated that either 
the tests used by him were all measuring fairly 
specific attitudes or that if a general factor of 


1 The author wishes to express his gratitude to Dr. 
H. J. Eysenck at whose suggestion and with whose ad- 
vice and encouragement this study was carried out. 
This work was done while the author was a London 
University Postgraduate Scholar at the Institute of 
Psychiatry. 


color-form attitude existed, his particular 
tests might be rather poor measures of it. 

Further doubts about the comparability of 
color-form tests arise from a study of the 
results obtained from the application of such 
tests to psychotic Ss. Thus while schizophrenics 
appear able to abstract color from form or 
form from color more easily than manic- 
depressives in the tachistoscopic memory 
test (15), no significant differences between 
the attitudes to color or form of the two groups 
were found on the sorting test employed by 
Eysenck (9). Hence the tests seemed more 
likely to be measuring the ability to dissociate 
than to be indicative of attitude to color and 
form when used with these groups. However, 
this interpretation has not been extended to 
normal subjects, for Kretschmer and _ his 
associates (8) have used response to color and 
form as a distinguishing feature between 
schizothymic and cyclothymic personality 
types which are posited to be the normal proto- 
types of the more extreme psychoses. 

In order to clarify the situation somewhat, 
a factor analysis of a number of color-form 
tests was carried out by Keehn (13, 14). A 
factor of color-form attitude was identified, 
but although this pointed to the comparability 
of the earlier tests in terms of method, the 
correlations between the tests were low enough 
to show that results obtained from the use of 
one test were of little value in predicting 
response to others. Hence it was felt (14) that 
a battery of tests based on the results of the 
factor analysis would give more valid and 
reliable results than those obtainable from 
the use of one test alone. 


Tue Test BATTERY 


From a large battery of color-form tests (13), nine 
were selected for special analysis (14), and it is these 
tests which constitute the present battery. The number 
beside each test refers to its seria! position in the larger 
battery (13). 


2. Tossing. A disc was tossed like a coin and S could 
respond to color or shape in guessing which side would 
fall uppermost. 

4. Memory: paired associates. Four capital letters 
had to be associated with four colored shapes. Colors 
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and shapes were later interchanged to see which cate- 
gory had been associated with the letters. 

8. Grouping. Twelve squares set out on a background 
could be grouped either by color or by disposition. 

10. Odd one out. Of nine figures on a card one was a 
lone shape and one a lone color. The subject was asked 
to pick out the odd figure. 

12. Stepping stones. The subject had to pick his way 
across a set of colored shapes from one point to another. 
Two correct paths were available, one where all the 
shapes were the same, the other where the colors were 
the same. 

13. Common elements. Three similar figures had to be 
chosen from eight. “Similarity” could be based on 
color or shape. 

19. Weigl sorting. Nine wooden blocks could be 
sorted by color or shape. 

22. Card sorting. A pack of cards was to be sorted 
into four compartments. All but four of the cards could 
be sorted in only one way; the remaining four could be 
placed either by color or by shape. 

24b. Tachistoscopic matching. A given figure had to 
be selected from eight figures exposed tachistoscopi- 
cally. Selection could be by color or form. No figure 
exactly matched the original. 


The factorial analysis of these tests yielded two 
factors, the test saturations being reproduced in Table 1. 
The first factor was identified as that of attitude toward 
color or form. The obtained first factor saturations 
were used as though they were correlations of the tests 
with the factor (21), and regression weights for maxi- 
mum predictability of the color-form factor by the 
battery were calculated according to the Ledermann 
method as set out in Thomson (21). The resulting 
weights are shown in Table 2. The ensuing multiple 
correlation of the battery with the factor was found to 
be .93 thus showing the battery, weighted in this fash- 
ion, to be a highly valid measure of the factor. 

In order to obtain an estimate of the reliability of 
the battery, use was made of the generalized Spear- 
Kr 
1 + (K-1)r 
as set out in Gulliksen (12), where rzis the estimated 
reliability of a total test whose inter-unit correlation is 
r and where the unit test is increased in length K times. 
This formula rests on the assumptions that all the units 
are psychologicaliy similar and that the correlations 


man-Brown prophecy formula rr 


TABLE 1 
Factor SATURATION 


33.50.38 .60).57 
— .46|— .43|.45).21). 


TABLE 2 
REGRESSION WEIGHTS 


Test 2; 4 | 8); 10 12, 13 


19; 22 24b 


Weight .20 .30 |.06 .21 |.19 .18 .15,.31 .09 
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TABLE 3 


PERCENTAGE OF THE Various Groups GIVING THE 
Cotor RESPONSE TO THE INDIVIDUAL TESTS 
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between them are all equal. In this case each individual 
test was taken as a unit and, as all nine tests have been 
shown to measure along a common dimension, the 
assumption of psychological similarity is not unreason- 
able. However, the correlations between the items 
varied between —.18 and +.44 with a mean of +.21, 
so that although the average of these correlations was 
used in the formula, the obtained reliability estimate 
of .71 must not be regarded as anything more than a 
general indication of the reliability of the battery 


Administration 


Although attention was primarily centered upon the 
nine tests forming the color-form battery, all the tests 
described by Keehn (13) were given to 25 schizophrenic, 
20 manic-depressive, and 25 neurotic males whose ages 
ranged from 19 to 39 years. The reason for including 
the extra tests was twofold. First, as the original normal 
group had received the whole battery it was deemed 
wiser to repeat this procedure with the patients rather 
than risk the introduction of an extra relevant variable 
and, second, it was anticipated that some of the extra 
tests would prove useful in differentiating between the 
groups for reasons other than their color content. 


RESULTS 

Table 3, by showing the percentage of Ss 
giving the color response to the individual 
tests, exhibits the sort of information pre- 
sented by the earlier studies. Similar per- 
centages obtained from a group of 200 normal 
males are also included. However, the main 
interest here is not centered upon the in- 
dividual tests but upon the nine-item color- 
form test battery as a whole. Consequently, 
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Table 4 shows means and standard deviations 
for the groups where each test was given its 
regression weight (see Table 2) and each 
person’s score was the algebraic sum of his 
weighted scores on all the nine tests. A positive 
score was given for a color response and a 
negative one was given for a form response. 
Again the results from the 200 normals are 
included. 








TABLE 4 
Group MEANS AND SD’s ON THE COLOR-FORM 
BATTERY* 

Group N Mean SD 
Normals 200 —27.4 81.28 
Schizophrenics 25 43.5 98 .40 
Manic-depressives 20 —2.5 90.53 
Neurotics 25 —28.5 


76.53 


* Positive scores show a preponderance of form responses 








DISCUSSION 


Normals vs. neurolics. There is widespread 
feeling among psychologists (10, 20), especially 
those having experience with the Rorschach 
test, that reaction to color is somehow linked 
with emotional instability. Consequently, 
since Eysenck (7), Clarke (5), and others have 
linked neuroticism with lack of emotional 
control, it seemed reasonable to expect neu- 
rotics to show greater reaction to color than 
normals. This was shown not to be so on the 
nine-test battery; neurotics reacted slightly 
but insignificantly less to color than the nor- 
mals. 

Only one of the individual tests, number 16, 
differentiated between the two groups. As 
this was at the .05 level, and as at least one 
such differentiation would be expected by 
chance, no significance was attached to this 
result. 

Schizophrenics vs. manic-depressives. Dif- 
ferential reaction to color and form is one of 
the bases of Kretschmer’s (16) typological 
system, and many studies carried out by his 
school (see Eysenck [8] for summary) suggest 
strongly that manic-depressives react more to 
color than schizophrenics. However, not only 
is this not supported by the scores of these 
groups on the present battery, but the dif- 
ference, although statistically nonsignificant, 
is decidedly in the reverse direction. 

Examination of the individual tests shows 
four which do differentiate between the two 
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groups. Test 25a, a comparison of reaction 
time to color with that to shape, and test 24 
separated the groups at the .001 level, and in 
both cases the manic-depressives obtained the 
higher color scores. Yet tests 8 and 19, dif- 
ferentiating between the groups at the .01 
and .05 levels respectively, show the schizo- 
phrenics to react more to color. Clearly, then, 
the factor distinguishing between the groups 
is not that of reaction to color, especially as 
the three tests appearing in the color-form 
battery have the lowest weights in that bat- 
tery. 

One possible explanation of these anomalous 
findings would be that manic-depressives 
have a lower threshold for perceiving color 
in brief exposure tests (numbers 245 and 25a) 
but react to form when time for deliberation 
is allowed, as in tests 8 and 19 where the 
response is made with the test material still 
visible to S. However, the dangers besetting 
phenomenological interpretations of these 
tests will be apparent and no suggestion 
without considerable experimental backing is 
worth taking seriously. 

Psychotics vs. normals. On the color-form 
test battery comparison of the normal and 
schizophrenic groups shows the latter to be 
significantly more color reactive beyond the 
.001 level of confidence. The manic-depressives 
fall between but are not significantly dif- 
ferent from either group. Since the combined 
psychotic group differs significantly from the 
normal group also, it is impossible to say from 
this evidence whether the battery separates 


TABLE 5 


SIGNIFICANCES OF THE DIFFERENCES BETWEEN THE 
Various GROUPS ON THE VARIOUS TESTS 


| Normals vs. Normals vs. Normals 
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out schizophrenics in particular, the manic- 
depressives behaving on this battery similarly 
to the normals, or whether it is measuring 
psychoticism in zeneral, the two psychotic 
groups being combined. 

In the light of Eysenck’s (9) finding of a 
general factor of psychoticism but nu group 
factor separating out the conventional psy- 
chotic subgroups, the latter alternative seems 
to be the most probable. Yet in Table 5, which 
depicts those individual tests differentiating 
between the normals and the schizophrenics, 
five tests occur in the color-form battery, 
whereas only one test in the battery dis- 
tinguishes between normals and manic-de- 
pressives. 

Further examination of Table 5 shows that 
as there is no overlap between the tests dif- 
ferentiating between the respective psychotic 
subgroups and the normals, it would appear 
that the postulation of a single psychotic 
dimension is inadequate. For if schizophrenics 
and manic-depressives lay along a single 
continuum of psychoticism, tests differentiat- 
ing between the closer groups would also dif- 
ferentiate between the extreme groups. Which- 
ever psychotic group lies nearer to the normals, 
in the present study this differentiation is 
not shown. 

Furthermore, as the difference between the 
means of the schizophrenics and manic- 
depressives approaches significance (¢ = 1.6) 
even with such small numbers, it is not un- 
likely that large groups would differ signifi- 
cantly on the battery. Hence rather than 
conclude that reaction to color on the color- 
form battery is indicative of psychoticism, it 
would seem better to regard responses to the 
battery as a measure of schizophrenia. 

Finally, in the last column of Table 5, six 
individual tests occur that discriminate sig- 
nificantly between the total psychotic group 
and the normals. Since four of these tests also 
differentiate between the normals and one or 
other of the psychotic subgroups, it is likely 
that in these cases the differences are largely 
due to the effect of the relevant subgroup. 
The remaining two tests, numbers 9 and 16, 
may be real measures of psychoticism or they 
may be exhibiting some of the chance dif- 
ferences which inevitably occur when a large 
number of comparisons are made, as is the 
case in the present investigation. 


J. D. KEEHN 


SUMMARY AND CONCLUSIONS 


A battery of nine color-form tests was given 
to 25 schizophrenic, 20 manic-depressive, 
and 25 neurotic patients, and their responses 
were compared with each other and with 
those of a normal group of 200 males. Several 
other tests were administered at the same 
time.. Results showed that neurotics do not 
differ significantly from normals in their 
responses to color on the test battery, and 
that manic-depressives respond less than 
schizophrenics do to color but not significantly 
so. The schizophrenics did, however, react 
significantly more to color than did the other 
groups, and it was argued that the battery was 
measuring schizophrenia specifically rather 
than psychoticism. 

Subsidiary evidence from the individual 
tests showed that although schizophrenics 
sometimes gave significantly more color 
responses than manic-depressives, the reverse 
was true of tests involving responses to briefly 
exposed stimuli. It was therefore suggested 
that interpretation of the processes involved 
in reaction to these tests must not be based 
upon appearances. Thus although Kret- 
schmer’s school seems to be correct in its 
assertion that cyclothymes react more to 
color than schizothymes on the tachistoscopic 
type color-form tests, it does not follow that 
cyclothymes are more reactive to color in 
general. 

Finally, from the pattern of tests differen- 
tiating between the normalsand schizophrenics 
and the normals and manic-depressives re- 
spectively, it was argued that a single factor 
of psychoticism was insufficient to account 
for the variations between the groups. It is 
likely that an investigation similar to that of 
Eysenck (9) but including some of the present 
tests will result, as Eysenck suggests, in the 
extraction of a factor of schizothymia-cyclo- 
thymia. However, such an investigation would 
need to be carried out with considerable care, 
for if two orthogonal factors such as psychot- 
icism and schizothymia-cyclothymia are to 
be posited it would be incorrect to regard 
psychotic subjects as necessarily at the ex- 
tremes of the latter continuum. 
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ANXIETY AND THE MODE OF RESPONSE!:? 


ROBERT E. SILVERMAN? 
Indiana University 


ECENT studies in the area of perception 
and motivation have often produced 
data with contradictory and am- 

biguous interpretations (3, 4, 6, 8, 11, 12, 
13). One explanation for this situation may be 
that interpretations are made which are bur- 
dened with ad hoc concepts like perceptual 
defense and perceptual sensitization that even 
now show signs of being eschewed by their 
authors (12). 

It is felt that these concepts are entirely 
superfluous and arise because investigators 
fail to control the stimulus situation and to 
consider responses apart from any goal-seeking 
or drive-reducing properties. It would seem 
that a fruitful approach at this time would 
involve studying the gross responses or a 
general class of responses that persons make 
to the immediate stimulus situation. Once 
we have acquired some information at this 
level, the next step might then be concerned 
with investigating the development or ac- 
quisition of responses as a function of stimulus 
conditions. 

In attempting to treat the first problem, 
that of studying the responses, it is necessary 
to develop a method of grouping or classifying 
responses as they relate to classes of stimuli. 
The classification scheme used in the experi- 
ment reported here simply entails a considera- 
tion of responses as they relate to the relevant 
as opposed to the incidental stimuli in a con- 
trolled stimulus situation. 

Given an experimental setting, we may 
designate the subject’s (.S’s) task as it is struc- 
tured by the experimenter (£) as being com- 
posed of relevant stimuli. All the controlled 
stimuli in the situation that have no observ- 
able relationship to the relevant stimuli are 
designated as incidental. We may assume that 


This p»per is a portion of a thesis submitted to the 
Denaitment of Psychology of Indiana University in 
partial fulfillm n' ct the requirements for the degree 
of Doctor of Philosophy. The writer is indebted to Dr. 
Bertram D Cohen for his advice and assistance 
throughout the study. 

* A part of this paper was read before the Eastern 
Psychological Association at Poston in 1953. 
* Now at New York University. 
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in the experimental setting an S’s mode of 
response can involve responding to: (a) the 
relevant or task stimuli, (0) the incidental 
stimuli, (c) both, (d) neither. 

Bahrick and Fitts (2) have demonstrated 
that increasing the reward value of a central 
task interferes with performance of a periph- 
eral task. As motivation was increased Ss 
tended to respond more to the relevant as 
opposed to the incidental aspects of the 
stimulus situation. Bahrick and Fitts’s Ss 
were rewarded or not rewarded for each dis- 
crete response. Thus, the positive reinforce- 
ment for each response was explicitly related 
to the relevant task. 

However, the question arises as to the gen- 
eral conditions that will elicit relevant as op- 
posed to incidental responses. What, for ex- 
ample, will be the effect of anxiety-evoking 
variables? Does anxiety enhance the selec- 
tivity of responding as some authors have 
predicted (4, 11, 13)? Or, as a second group of 
writers have thought (5, 9, 10), do the condi- 
tions which evoke anxiety produce a general 
unselective type of responding? 

The present study was designed to inves- 
tigate the effects of anxiety—defined as the 
presentation of the threat of electric shock— 
on the mode of response of human Ss. The 
experimental situation permitted some degree 
of control over the relevant and the incidental 
stimuli, and measures of the responses to 
these were obtained. If S’s mode of response 
emphasized responding to the relevant stimuli, 
we would expect greater improvement in his 
performance on the experimental task as com- 
pared to improvement in performance of those 
Ss who respond to the incidental stimuli or 
who respond to neither the relevant nor inciden- 
tal stimuli. The Ss who respond to the inciden- 
tal stimuli would manifest more incidental 
learning than Ss who use the other modes of 
response. 

The possibility exists that responding to 
both the incidental and relevant stimuli could 
raise or lower performance with respect to 
either class of stimuli. In order to check this, 
two types of relevant tasks have been used. 
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One task was considered as being difficult and 
the second task was a very easy one. In this 
way the relationship between the relevant 
and incidental tasks was observed. 


METHOD 


Subjects. The Ss were 146 students from first-year 
psychology courses. They were assigned randomly to 
six groups. The groups each contained 25 Ss with the 
exception of groups E and F, each of which consisted of 
23 Ss. 

Apparatus. The experiment was performed in a 
sound-deadened room 8 ft. wide by 14 ft. long by 8 ft. 
high. The apparatus consisted of a response lever which 
S moved to indicate a response, a film strip projector 
which was used to present the relevant task stimuli, and 
a wire recorder and separate speaker attachment which 
were used to present the incidental stimuli. In addition, 
an inductorium was used to administer a sample shock 
to the shock-threat groups. 

The response lever was a 2-in. plastic rod projecting 
through an aperture in a 23-in. by 43-in. wooden screen. 
The rod was connected to a brass slider which in turn 
was connected to an endless loop of string that moved 
a writing pen attached to an Esterline-Angus recording 
system. The arrangement was such that when the rod 
and slider were moved 9 in. the movement was recorded 
as a 2.2-in. move on the recording tape. 

The stimuli for the relevant task were presented by a 
film strip projector connected to a timing system which 
changed the film strip frame every 10 sec. The stimuli 
were five black lines 0.5 in. in width and varying in 
height from 2.25 in. to 3.25 in. in steps of .25 in. These 
stimuli were presented on a 6 by 8-in. white screen at 
S’s eye level. The projector was behind the screen; thus 
only the screen and the response lever were visible to S. 

The incidental stimuli were 20 two-syllable words 
randomly selected from 94 spondaic words developed 
at the Psycho-Acoustic Laboratory of Harvard Univer- 
sity (7). The words used were: railroad, sidewalk, mid- 
way, whitewash, playmate, buckwheat, sundown, 
platform, lifeboat, cookbook, outside, beehive, work- 
shop, headlight, doorstep, scarecrow, wildcat, baseball, 
grandson, stopwatch. Ten repetitions of the words were 
recorded in 5-min. intervals on a wire recorder (Radio- 
technic, Model 55A) in a woman’s voice. The incidental 
stimuli were introduced by switching on the wire 
recorder which played through a muffied 6-in. speaker 
placed in the ceiling of the experimental room. The 
volume output for the recorder was set at 50. At S’s 
position, the sound-level intensity of the words was 
approximately as loud as “subdued conversational 
speech.”’ 

The sample shock was administered through an 
inductorium energized by two 1.5-v. dry-cell batteries 
in series. The primary control switch was set for con- 
tinuous vibrator operation and the output voltage was 
at maximum. 

Preliminary training and pretest. Each S in groups A, 
B, C, and D was seated at the apparatus, and the 
projector was switched on to present the training 
stimulus. The S was instructed that this was to be a 
discrimination task and that he was to learn to move the 
key when he saw the line which was now on the screen, 
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but that he was not to respond when any other line 
appeared. A brief training series followed during which 
E reinforced a correct response by saying “that’s right”’ 
and indicated a wrong response (failing to move when 
the correct line was presented or moving the response 
key when an incorrect line was presented) by stating 
“you should not move the key since that is a longer 
(shorter) line” or “you should move the pointer since 
that is the correct line.” During the training series the 
stimuli were presented in the following order: 3, 3, 3, 
3, 5, 3, 1, 3, 2, 3, 4, 3, where 3 indicates the correct 
line (2.75 in.), 1 is the longest line (3.25 in.), and 5 is the 
shortest line (2.25 in.). Following the training series 
each S received a pretest in which the lines were pre- 
sented in the following order: 3, 5, 3, 3, 3, 2, 3, 4, 3, 1, 
424,444 5 8,35,5 

An S who made more than five errors on the pretest 
was eliminated from the remainder of the experiment. 
Eleven Ss were eliminated. An error was defined as 
responding incorrectly to a line other than 3 or failing 
to respond to line 3. 

The Ss in groups E and F were exposed to a very 
easy discrimination task during the training and pre- 
test series. They were instructed to respond whenever 
a line (any line) appeared on the screen. The lines were 
the same as those used for the other groups. 

Test series. For all Ss the pretest series was followed 
by a 1-min. rest period and then a test series. In the 
test series the order of presentation was: 3, 5, 1, 2, 4, 5, 
4£1,2441254.1,4424524464645 442% 
4, 1, 4, 2, 1, 5, 3. The Ss in groups E and F again 
merely had to respond when any line appeared on the 
screen. 

The experimental conditions were introduced dur- 
ing the test series for all six groups. Group A performed 
the test series while being threatened with electric 
shock. Electrodes were attached to 5S’s left wrist and 
left ankle (the ankle electrodes were “props” in that 
they were not attached to the inductorium). Each S 
was told that he would receive an occasional electric 
shock during the test series. A sample shock was given 
before the series began, but shock was not administered 
again during the experiment. One minute after the 
presentation of the first line, the words (incidental 
stimuli) were started. The words continued until the 
final 15 sec. of the test series. Thus the words were 
presented while visual stimuli 7 through 34 were being 
presented. 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE 
RECALL AND RECOGNITION SCORES 


Difficult 


Easy 


B Non- E F Non- 
< 
A Shock |" shock | Shock | shock 


| 


Mean |} 6.56 12.20 | 11.09 | 19.13 
SD 6.93 11.83 | 10.44 | 11.61 
Recognition | | 
Mean | 4.31 19} 5.09 | 7.25 
SD .60 | 4.18 3.88 











TABLE 2 
ANALYSIS OF VARIANCE OF THE RECALL AND RECOGNITION SCORES 
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Recall Recognition 
Source en. © ee : riz 7 — 
| Sum of df | Mean F Sum of af | Mean 
| Squares | | Square | Squares | ~ | Square 
Difficulty of task 786.75 1 786.75 | 5.879* 20.25 1 | 20.25 | 1.010 
Shock and nonshock 1105.71 1 | 1105.71 | 8.263** 97 .35 1 | 97.35 | 4.856* 
Interaction task X Shock 34.58; 1| 34.58 .258 1 47 .023 
Within cells 12311.34 | 92 133.82 1844.25 | 92 | 20.05 
* Significant at the .05 level of confidence. . a . ied es 
** Significant at the .01 level of confidence. 
Group B performed the discrimination test under TABLE 4 


the same conditions as group A with the exception of 
the shock threat. Group C performed the discrimination 
test under the threat of shock as did group A, but this 
group was not exposed to the words. Group D was 
exposed to neither the shock threat nor the words. 
Groups E and F performed the easy discrimination. 
Group E was subjected to shock threat and exposed to 
the words during the test series. Group F received no 
threat of shock but was exposed to the words during the 
test series. 

Test for incidental learning. Following the dis- 
crimination test, Ss who had been exposed to the words 
were given a recall test and a recognition test for the 
incidental words. Recall scores were obtained by allow- 
ing full credit for a word correctly recalled, half credit 
for one syllable of the two-syllable word correctly re- 
called, and one-fourth credit for any two-syllable, 
spondaic word designated. The total score was multi- 
plied by 4 to eliminate fractions. Recognition scores 
were obtained by having Ss select words from a matrix 
containing 80 words. A score was corrected for guessing 
by subtracting from the number correct one-third the 
number of incorrect words. 


RESULTS 
Incidental learning. The mean recall and 
recognition scores for the groups exposed to 
the words are presented in Table 1. Inspection 
of these tabulated data indicates more evidence 


of incidental learning in the groups (E, F) 
which performed the easy discrimination task 


TABLE 3 
MEANS AND STANDARD DEVIATIONS OF THE 
DISCRIMINATION TEST SCORES 


Words | No Words 
Measure ay Coan, 
. | B Non- ¢@ | D Non- 
S - | ~k | 
A Shock | shock. | C Shock Peers 
Mean 27 .96 29.56 30.00 27.00 
SD 4.29 3.24 3.44 


3.21 


ANALYSIS OF VARIANCE OF THE DISCRIMINATION 
Test ScoREs 








| Sumof | 4, | Mean | 
Source } omen | of dene | 
Words 1.69 1) 1.690) .127 
Shock 12.25} 1 12.250) .928 
Words X Shock 132.25; 1 132.2509.941** 
Within cells 96 | 13.303 


/1277.12 


** Significant at the .01 level of confidence. 


than in the groups (A, B) which performed 
the more difficult discrimination task. Even 
more marked is the superiority in incidental 
learning, as evidenced by their mean scores, 
of the nonshock groups (B, F) over the shock 
groups (A, E). 

The results of the analysis of variance for the 
incidental learning scores are presented in 
Table 2.4 That the degree of difficulty of the 
discrimination task was an effective variable 
is indicated by the F ratio for the recall scores. 
However, the differences between the non- 
shock and shock conditions were statistically 
significant for both the recall and recognition 
scores. 

Discrimination task. Discrimination scores 
were obtained by tabulating the number of 
correct responses made by each S during the 
pretest and test series. An analysis of the pre- 
test data indicated that significant differences 
did not exist among the four groups. Thus it 


* The variances for the recall scores were not homo- 
geneous. Transforming the data so as to satisfy the 
assumption of homogeneity does not change the prob- 
abilities (F for shock, nonshock was 8.779; F for 
difficult, easy was 5.035). Therefore the statistical test 
for the nontransformed data is presented in Table 2. 
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was assumed that the groups were relatively 
similar with respect to initial discrimination 
ability. 

The means and SD’s for the discrimination- 
test scores are presented in Table 3. The 
analysis of variance for these data is presented 
in Table 4. 

A significant interaction effect was found 
between the experimental conditions where no 
over-all differences had been found between 
the words, no-words conditions or between the 
shock, nonshock conditions. This interaction 
would seem to indicate that the two experi- 
mental conditions, shock and words, act dif- 
ferently in combination than either of these 
conditions acts alone. The discrimination of 
Ss exposed to shock threat is slightly impaired 
when the words are present as compared to 
the no-words condition. However, this im- 
pairment cannot be attributed merely to the 
presence of the words, for a comparison of the 
words and no-words groups which are not 
exposed to shock threat reveals that the words 
condition alone improved performance. It is 
only when the words are combined with the 
threat of shock that performance is disrupted. 


DISCUSSION 


The results of the incidental learning tests 
are generally in agreement with those reported 
by Aborn (1) who used very different stimulus 
materials. His data indicate that “‘ego threat” 
decreased incidental learning as measured 
by recall and recognition scores. We might 
interpret such findings as offering support for 
the notion that threat acts to increase the 
relevant mode of response. In the study re- 
ported here, we would expect an S who 
responds predominantly to the relevant 
stimuli to show decreased incidental learning, 
as would an S who responds to neither class of 
stimuli. Responding to both classes should 
produce relatively good performance in both 
spheres since the facilitating effects of the 
words would cancel the interfering effects of 
the discrimination task. It is also possible that 
responding to some degree in the direction of 
the incidental stimuli might increase the rela- 
tive efficiency in both tasks. 

The mode of response of Ss exposed to 
shock threat may be considered as being di- 
rected toward the relevant stimuli since these 
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Ss show relatively good discrimination when 
no words are present and poor discrimination 
as well as poor incidental learning when ex- 
posed to the words. The possibility exists 
that the shock-threat Ss attempt to avoid 
listening to the words and that these avoidance 
responses in turn interfere with discrimination 
accuracy. This bit of speculation stems from 
the fact that the group (A) exposed to both 
words and shock threat shows impaired dis- 
crimination. 

The fact that shock threat elicits a relevant 
mode of response is additional evidence for 
what appears to be the selective and stereo- 
typed patterns of response often attributed 
to the chronically anxious person. Whether 
or not the condition of anxiety per se enhances 
this mode of response is problematical. How- 
ever, by relating the mode of response to the 
discrete responses investigated by Bahrick 
and Fitts (2) the acquisition of the relevant 
response can be investigated within a general 
learning framework. We may consider the 
mode of response as a learned response ac- 
quired in the same manner as any class of 
discrete responses. Accordingly, we would 
predict that persons with a history of clinical 
anxiety would in a situation similar to that 
reported above manifest a strong relevant 
mode of response.® 

In the study reported here, the relevant 
response is defined by E. The task itself cannot 
be said to have any anxiety-reducing proper- 
ties, but by responding to the structured task 
Ss are literally conforming in a miniature social 
situation. This is to be expected, since, in a 
discussion of human behavior, relevance in 
its broad sense is defined by society. Thus we 
would predict and do find that a large segment 
of the anxious patients are rigid conformists. 
The typical compulsive patient with symptoms 
of manifest anxiety is largely a relevant re- 
sponder although the rituals and compulsions 
often indulged in by these patients appear to 
be unusual. The explanation for this is that 
relevance is defined by subcultures and even 
by fantasy activity as well as by the mores of 
society at large. Thus a patient who indulges 
in a bizarre ritual is probably making his 
own interpretation of relevance. 


5 A study using test scores to establish a history of 
anxiety is in progress to test this prediction. 
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SUMMARY 


This experiment investigated the influence 
of shock threat on the responses to incidental 
auditory stimuli and relevant visual stimuli. 
The Ss were trained to perform a visual dis- 
crimination task and were then tested for 
discrimination accuracy. The Ss who per- 
formed the discrimination test in the presence 
of incidental auditory stimuli were tested for 
recall and recognition of these stimuli. 

The results indicated that Ss exposed to 
the threat of electric shock manifested less 
incidental learning than Ss not exposed to 
shock threat. The shock-threat condition in 
the presence of the incidental stimuli impaired 
discrimination, but the shock-threat condition 
alone improved the accuracy of discrimina- 
tion. 

The results were interpreted as lending 
support to the clinical data that show anxious 
persons to be relevant responders or conform- 
ists. 
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AN ANALYSIS OF THE CATTELL-LUBORSKY HUMOR TEST 
INTO HOMOGENEOUS SCALES 


JAMES K. YARNOLD anp MARVIN H. BERKELEY 
Human Resources Research Center, Lackland Air Force Base 


r HE ANALYSIS of reactions to various 
types of humor stimuli has sometimes 
been regarded as a potentially valuable 

means of personality measurement. The prob- 
lem of classification and interpretation of 
responses to humor, however, has been a de- 
terrent to the exploratory use of humor 
scales in personality research. 

Andrews accomplished the first large-scale 
statistical analysis of a sample of humecr 
items (2). His facter analysis of a 24-item 
matrix demonstrated that humor response 
was not of an entirely general nature as main- 
tained by certain early theorists (9). The item 
intercorrelation range (—.10 to .59, tetra- 
choric) indicated the magnitude of covariation. 
In addition, his study showed that the specific 
item structure, whether it be cartoon, joke, 
or limerick, was not of primary importance. 
While this latter information is certainly 
important, it should be pointed out that 
Andrews’ factors would have little utility as 
actual measures of humor response. 

Similarly, the research of Cattell and others 
has yielded little in terms of sufficiently homo- 
geneous and independent measurement de- 
vices. The first clusters (Cattell and Luborsky 
[4]) were developed with a too lenient criterion 
for including an item in a cluster, resulting in 
measures with low reliabilities. A later study by 
Cattell, Luborsky, and Hicks (6) was con- 
cerned with the factor analysis of a sample of 
humor items. The factor tests resulting from 
this analysis also fail to meet the minimum 
qualifications for an adequate set of scales. 
Moreover, their psychological meaning appears 
overly complex. The median reliability (Kuder- 
Richardson Formula 20) is .29, the coefficients 
ranging from .18 to .51. Although the median 
intercorrelation between factor tests is only 
17, the limiting effect of the very low reli- 
abilities suggests that this degree of indepen- 
dence may be spurious. 

On the basis of the scaling deficiencies of 
the humor clusters and factors derived from 
the three major attempts at empirical clas- 


sification of humor items, it is apparent that 
no adequate measures of humor response yet 
exist. The task of the project described here 
has been to derive an unspecified number of 
homogeneous scales from a _ heterogeneous 
pool of humor items. 


RESEARCH DESIGN 


The research design for this task was organ- 
ized as follows: (a) selection of first item set, (d) 
selection of subject sample, (c) test administra- 
tion, (d) analysis into tentative scales, (e) selec- 
tion of second item set, (f) analysis into final 
scales, (g) comparison of the tentative and final 
tests, and (k) psychological interpretations of 
final scales. Each of these phases of the research 
design is discussed in the above order. 


Selection of First Item Set 


It is obvious that the adequacy of coverage of the 
personality domain is limited by the coverage of the 
items selected for analysis. Cattell and Luborsky had 
selected the items for their C-L Humor Test, Forms A 
and B, from an extensive pool of 2,000 jokes and limer- 
icks. It was decided to capitalize on their previous work 
by selecting as the initial set one of the two forms of 
their test. Form A (91 items) required the forced-choice 
selection of the funnier of two jokes. Form B (112 items) 
required a response of funny or dull for each item. The 
first approximation of homogeneous scales was based 
on the collection of the more highly intercorrelated 
clusters developed by Cattell and Luborsky (5). Since 
Form A clusters (forced-choice items) presented many 
more interpretative problems than Form B clusters, 
items of the latter form were selected as the initial item 


pool. 
Selection of Subject Sample 


The sample on which the Form B homogeneous keys 
were developed consisted of 500 completed answer 
sheets of officer candidates at Lackland Air Force 
Base. The test was administered while most of the 
officer candidates were in their second week of training. 
In order to facilitate computational procedures, the 
sample was decreased to 500 cases from a number 
slightly over that amount, and stratified on the basis 
of sex. 

The cross-validation sample used to test the stability 
of the Form B keys consisted of 293 officer candidates of 
a subsequent class. The proportion of males and females 
as well as other OCS selection procedures remained 
unchanged for the cross-validation sample. 
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TABLE 1 


PRopUCT-MOMENT INTERCORRELATIONS AND DISTRIBUTIONAL STATISTICS OF THE FINAL HomMOGENEOUS TESTS 
WITH AN INDEPENDENT SAMPLE 


(Sample: 230 Students, OCS Class 53-A) 


RF, RG, RG; R S 


— .09 31 ‘ .19 

27 ll AE .23 
.26 

33 

27 


* Kuder-Richardson Formula 20. 


Test Administration 


The Form B item set was administered in testing 
rooms containing approximately 40 subjects (Ss) each. 
The Ss were told that their “sense of humor” was being 
measured. (Cattell and Luborsky had labeled Form B 
as “A Humor Rating Quiz.”) An unlimited amount of 
time was allowed for test completion, although Ss were 
asked to record their immediate opinion of items. In 
order to reduce social facilitation, Ss were requested to 
inhibit audible laughter. However, in all testing groups, 
a few Ss found it impossible to comply with this request. 
Since Ss answered items at different rates of speed, it 
seems unlikely that individual item responses were 
seriously affected. 


Analysis into Tentative Scales 


A direct and economical means of developing 
independent tests that have an adequate degree of 
discriminating power has been suggested by DuBois, 
Loevinger, and Gleser (8). This technique is known as 
maximizing the discriminating power of a test. This 
method avoids a number of difficulties inherent in the 
application of factor analytic techniques to items and 
also appears to lack the principal limitations of other 
methods of maximizing test homogeneity (8, pp. 2-6). 
This technique was thus selected for clustering the set 
of humor items into homogeneous tests. The major 
steps are listed below:! 

Grouping of items into initial submatrices. It seemed 
that the later clusters developed by Cattell and Lubor- 
sky (5) might possibly be better than purely a priori 
groupings of items. Thus, Cattell-Luborsky cluster 
scores for OCS Class 51-A were intercorrelated, and on 
the basis of the intercorrelations, the C-L clusters were 





1 The details of the procedures and the results have 
been filed with the American Documentation Institute. 
Order document No. 4286 from the ADI Auxiliary 
Publications Project, Photoduplication Service, Li- 
brary of Congress, Washington 25, D. C., remitting in 
advance $1.75 for 35 mm. microfilm or $2.50 for photo- 
prints. Make checks payable to Chief, Photoduplication 
Service, Library of Congress. 
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grouped into three matrices containing about 40 items 
each. 

Analysis into tentative scales. Five tentative scales 
were developed by the DuBois, Loevinger, and Gleser 
method for maximizing the discriminating power of a 
test. These scales and others, subsequently derived, will 
henceforth be referred to by the letter of the matrix 
from which they were evolved and asubscript correspond- 
ing to the order of evolvement. The tentative scales 
were labeled D,, Fe, F;, Gi, and Ge. 

Selection of second item set. The second item set from 
which the tentative scales were to be expanded into 
final scales consisted of the 91 forced-choice items of 
Form A of the C-L Humor Test. However, Ss were not 
required to choose one of two paired jokes, but instead 
made an independent rating of funny or dull for each 
joke. This resulted in a pool of 182 items which, along 
with Form B, was administered to a new sample of 400 
officer candidates. 

Analysis into final scales. It was the purpose of this 
phase of the research to increase the discriminating 
power of the tentative scales and to develop additional 
scales from items of a new sample. In this analysis four 
of the five tentative scales were retained and increased 
in length. These scales were relabeled RD;, RF2, RG, 
RGz, the R standing for revised. Three additional scales 
were evolved with labels R, S, and T. The seven scales 
constituted the final homogeneous tests. Their inter- 
correlations and distributional statistics are given in 
Table 1, for an independent sample of 230 officer 
candidates. 


Psychological Interpretations of Final Scales 


On the basis of a number of experiments Sears (14) 
concluded that a joke may be analyzed into two central 
elements: thematic and schematic. The schematic 
element is the technical structure or form of the joke, 
while the thema “constitutes the meaningful human 
content, or plot, of a joke.” It is the joke thema “which 
evckes the asocial or repressed needs, principally 
Aggression, Superiority, and Sex. Such a joke is the 
imaginal presentation of a thema which provides— 
suddenly and gratuitously—the end situation of one of 
the subject’s needs, fused with n Play. The mirth- 
thema, then, brings added delight by satisfying in 
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fantasy some purely egocentric and usually hostile wish. 
The delight is not inhibited because it can escape, 
disguised by the playful social feelings engendered by 
the mirth-schema in which the mirth-thema is em- 
bedded” (13, p. 584). 


A tentative description within the Sears 
frame of reference of each of the final homo- 
geneous scales is presented below. A typical 
item is presented with each test for illustrative 
purposes. 

RD,. The element common to the items of 
RD; is that of sexual play or sexual need. 
Sexual fulfillment is accepted as an integral 
part of life. However, this need for sexual ful- 
fillment results in frustrations which are not 
necessarily resolved by “conventional”’ social 
mores. 

It seems reasonable to hypothesize that RD; 
will be highly weighted on the following factors 
or scales: “Social Introversion” (Guilford & 
Guilford [11]), “Adventurous Cyclothymia 
vs. Inherent Withdrawn Schizothymia” (Cat- 
tell [3]), and “Social Introversion” (Mosier, 
{12]). 


Example: The census taker viewed Mrs. Jones and the 
six tots of varying ages around her with a puzzled 
frown. He seemed particularly interested in a squirming 
infant in her arms. “I don’t quite understand you,” he 
admitted. “Did you say that your husband died six 


years ago?” “Yes, sir,” she replied emphatically. “He 


died, but I didn’t.” 


RF;. The items of this test have in common 
the display of active self-assertion by an in- 
dependent, dominant, and somewhat uncon- 
ventional individual. 

It is hypothesized that RF; affords a subtle 
measure of the “Dominance” factor reported 
by Cattell (3), the “Lack of Agreeableness” 
factor of Guilford and Guilford (11), or in- 
versely, the “Sociable” factor of Thurstone 
(15). 


Example: One prehistoric man to another: “Now that 
we’ve learned to communicate with each other—shut 
up! 


RG;. In all items of this test, one detects 
the presence of a scheming, manipulative 
individual who is directing the action of his 
associates and taking full advantage of all 
opportunities to assert himself. This assertion 
usually takes aggressive form of the extra- 
punitive type. In some cases the aggression 
is explicit, as in a hard, cutting retort to a 
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rather stupid, dull, or otherwise uncalled for 
remark by an overly inquisitive, annoying, or 
generally dull character. In other items the 
aggression is implicit, as in the case of the 
clever, rather malicious opening statement 
designed to allow the replying individual to 
make himself appear ridiculous. In all cases 
the directing influence of a shrewd, exploita- 
tive, and somewhat aggressive individual is 
evident. 

lt is suggested that RG; may be related to 
the variance which the 16PF Questionnaire 
Factor L is purported to measure, “Paranoid 
Schizothymia vs. Trustful Accessibility” (5). 


Example: A famous musician once asked a man why he 
forever sang one tune. 

“Because it haunts me,” replied the man. 

“No wonder,” said the musician, “you are con- 
tinually murdering it.” 


’ 


RG,. The items of RG» are characterized by 
a common element consisting of a general 
lack of social insight and sophistication on the 
part of one character. In nearly all cases a 
naive individual (usually juvenile) makes a 
statement which has the effect of upsetting 
the equilibrium or poise of the other individual. 


Example: Willie: ‘Papa, why don’t you go away so I 
can see Mama dump that salesman off the fire escape?” 
Papa: “Why, Willie, what on earth are you talking 
about?” 
Willie: ‘Well, I heard her tell him that she’d tip 
him off as soon as you left.” 


R. The central thema of this test deals with 
the ridicule of authority figures (i.e., judges, 
ministers, doctors, teachers, parents, etc.). 

It is hypothesized that individuals obtaining 
a low score on Test R are those who identify 
with the ridiculed authority figure and are 
prototypical of the authoritarian personality 
(Adorno et al [1}). 


Example: “I rise for information,” said a member of the 
legislative body. 

“T’m very glad to hear it,” said a bystander, “for no 
man needs it more.” 


S. The central element of Test S appears 
to be a passive acceptance of inferior or 
socially disapproved behavior by the leading 
character. In each case the individual reveals 
his own intellectual or moral inertia. There is 
no attempt to alter behavior in a more desir- 
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able direction. Actually, the principal character 
often seems satisfied with his resignation. The 
general implication is that the individual is 
too weak to better himself and so merely 
accepts his intellectual or moral inferiority. 


Example: An intoxicated man, walking along the street, 
regarded the moon with much contempt. “You needn’t 
feel so proud,” he said, “you are full only once a month, 
and I am full every night.” 


T. The humor of this test seems to have its 
basis in the slightly cynical realism displayed 
by the principal characters. It is hypothesized 
that high scorers on Test T will be those in- 
dividuals who are sensitized to the foibles of 
men or women in everyday real-life situations. 
The common element of Test T is one which 
deals with the contrast between human pre- 
tension and performance. It appears that this 
test will relate co the emotional stability vs. 
general neuroticism factor suggested by 
Eysenck (10) and Cattell (3), and to the lack 
of objectivity factor as suggested by Guilford 
(11). 


Example: “Did he take his misfortunes like a man?” 
“Precisely. He laid the blame on his wife.” 


CONCLUSIONS 


The following conclusions are drawn from 
this study: 

1. An analysis of both Form A and Form B 
of the Cattell-Luborsky Humor Test by a 
recently proposed cluster method has resulted 
in the development of seven scales which 
appear to be superior, at least statistically, to 
the scales previously derived for this test. 

2. The apparent content of each of the seven 
scales has been briefly described and related 
to other factors and scales. 

3. The analysis of reactions to humor stim- 
uli does seem to provide a potentially effective 


James K. YARNOLD AND MARVIN H. BERKELEY 


and subtle means for the measurement of 
personality; however, validation is necessary. 


REFERENCES 


. Aporno, T. W., FRENKEL-BRUNSWIK, Etse, LE- 
vinson, D. J., & Sanrorp, R. N. The authoritar- 
ian personality. New York: Harper, 1950. 

. AnprEws, T. G. A factorial analysis of responses 
to the comic as a study in personality. J. gen. 
Psychol., 1943, 28, 209-224. 

. CaTTELL, R. B. Personality, a systematic theoretical 
and factual study. New York: McGraw-Hill, 
1950. 

. CatTrett, R. B., & Luporsxy, L. B. Personality 
factors in response to humor. J. abnorm. soc. 
Psychol., 1947, 42, 402-421. 

. Catret, R. B., & Lusorsky, L. B. Handbook for 
the C-L Humor Test of Personality. Champaign, 
Ill.: Institute for Personality and Ability Testing, 
1947. 

. CaTTeLL, R. B., Luporsxy, L. B., & Hicks, V. 
Revised handbook for the C-L Humor Test of 
Personality. Champaign, Ill.: Institute for 
Personality and Ability Testing, 1950. 

. Catrett, R. B., Saunpers, D. R., & Stice, G. 
Handbook for the 16 PF Test. Champaign, II: 
Institute for Personality and Ability Testing, 
1951. 

. DuBors, P. H., Loevincer, Jane, & GLESER, 
GoxtptnE C. The construction of homogeneous 
keys for a biographical inventory. USAF, Hum. 
Resour. Res. Cent., Res. Bull., 1952, No. 52-18. 

. Eastman, M. The sense of humor. New York: 
Scribner’s, 1921. 

. Eysenck, H. J. Dimensions of personality. London: 
Kegan Paul, 1947. 

. GuriForp, J. P., & GumFrorp, Ruts B. Personality 
factors S, E, and M, and their measurement. J. 
Psychol., 1936, 2, 109-127. 

. Moster, C. I. A factor analysis of certain neurotic 
tendencies. Psychometrika, 1937, 2, 263-286. 

. Murray, H. A. Explorations in personality. New 
York: Oxford Univer. Press, 1938. 

. SEARS, R. N. Dynamic factors in the psychology of 
humor. Unpublished doctor’s dissertation, Har- 
vard Univer., 1934. 

15. Tuurstone, L. L. The dimensions of temperament. 
Psychometrika, 1951, 16, 11-20. 


Received July 6, 1953. 





SOME EFFECTS OF UNEQUAL DISTRIBUTION OF INFORMATION 
UPON GROUP PERFORMANCE IN VARIOUS 
COMMUNICATION NETS'! 

MARVIN E. SHAW? 


University of Wisconsin® 


NCE Bavelas (1) first presented a mathe- 
matical model for group structures (or 
communication nets), several studies 

(2, 4, 5) have been concerned with finding 
behavioral correlates of the various measures 
which the model generates. The measure that 
has so far proved most significant is centrality. 
An index of centrality for any particular group 
structure is obtained by first counting the 
minimum number of links (communication 
channels) which must be crossed in order for 
a given position in the net to communicate 
with every other position in the net. This 
count is called the sum of distances for that 
position. Individual centrality is then com- 
puted by dividing the total sum of distances 
for the entire net by the sum of distances for 
the individual position, and net centrality is 
computed by summing the individual centrali- 
ties in the net. 

Centrality, according to Leavitt (5), cor- 
relates well with the behavior of people in 
different kinds of structures. He required 
groups to solve simple problems and compared 
their performance in different nets. Centrality 
accounted reasonably well for differences in 
accuracy, total activity, satisfaction of group 
members, emergence of a leader, and organiza- 
tion of the group. In Leavitt’s study, however, 
the information from which the group solved 
its problems was distributed equally among 
members of the group. It is a reasonable 
hypothesis that distribution of information, 
as well as centrality, may be an important 
variable in group behavior. The purpose of 


! This research was supported in whole or in part by 
the United States Air Force under contract AF 18(600)- 
340, monitored by the Human Resources Research 
Institute, Maxwell Air Force Base, Alabama. 

2 The present paper represents a portion of the dis- 
sertation presented in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy at the 
University of Wisconsin, 1953. The writer wishes to 
express his appreciation to Dr. J. C. Gilchrist, under 
whose supervision this investigation was conducted, 
for his guidance, helpful suggestions and constructive 
criticisms in every phase of the investigation. 

3 Now at The Johns Hopkins University. 


the present study is to examine such a hy- 
pothesis. 

The reasoning was as follows: centrality 
correlates with problem-solving behavior 
because the greater the centrality of a position 
the more information is available to the 
individual in that position. If this be so, then 
distributing information unequally in the 
first place by giving one position more informa- 
tion than another ought to have the same 
consequences as increasing the centrality of 
that position.‘ Therefore, an experiment was 
designed in which centrality could be studied 
simultaneously with information distribution 
in order to evaluate the relative contributions 
and interactions of the two variables. 


METHOD 


Apparatus and materials. In the present study three 
four-cell nets were selected for investigation: the wheel, 
the slash, and the circle. These nets are represented 
diagrammatically in Fig. 1, and their chief characteris- 
tics are given in Table 1. These nets were selected be- 
cause the net centralities progress in equal steps from 
16.0 for the circle to 16.8 for the wheel 

The apparatus consisted of a room which was parti- 
tioned into four cubicles. Each cubicle was connected 
to every other cubicle by means of a slot through which 
message cards could be passed. These slots were fitted 
with overlapping cloth “curtains” so that it was impos- 
sible for one subject (.S) to see into another’s cubicle. 
The frames supporting these curtains were painted to 
indicate the cubicle into which the slot opened. Each 
cubicle was supplied with an electrical switch which, 
when thrown, turned out a light on £’s signal panel. 

The following is an example of the type of problem 
used: 

Statement of problem: 

A small company is moving from one office 
building to another. It must move four kinds of 
equipment: (1) Chairs, (2) Desks, (3) Filing 
Cabinets, and (4) Typewriters. How many 
trucks are needed to make the move in one 
trip? 





*The assumption that increased information is 
equivalent to increased centrality is undoubtedly an 
oversimplification of a rather complex process. Altering 
the proportions of the total information assigned to the 
various positions almost certainly changes other charac- 
teristics of the net as well as centrality. However, this 
oversimplification is perhaps justified in this early 
stage of theoretical development. 
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Items of information needed to solve: 

. The company owns a total of 12 desks. 

. The company owns a total of 48 chairs. 

. The company owns a total of 12 type- 
writers. 

. The company owns a total of 15 filing 
cabinets. 

. One truck will carry 12 typewriters and 
nothing else. 

. One truck will carry 3 desks and nothing 
else 

. One truck will carry 5 filing cabinets and 
nothing else. 

8. One truck will carry 24 chairs and nothing 

else. 

Two other problems of the same type were em- 
ployed. One of these required the Ss to determine 
the time of arrival of a plane at a specified town, and 
the other required them to determine the amount of 
gasoline required to move vehicles from one motor pool 
to another. For each of these three problems, the 
statement of the problem was typed completely on each 
of four separate cards and each item of information 
was typed on a separate card. The materials for a 
single problem consisted of these twelve white cards. 

Subjects. The Ss were students enrolled in psy- 
chology or socivlogy courses at the University of 
Wisconsin. Ninety groups (sixty exclusively female and 
thirty exclusively male) were run. Each of these groups 
was assigned at random to one of six experimental 
conditions, subject to the restriction that five groups of 
males and ten groups of females be assigned to each 
experimental condition. The six experimental condi- 
tions represented the possible combinations of two types 
of information distribution and three patterns of com- 
munication. 

Procedure. Each group was required to solve the 
three problems mentioned above. Under conditions of 
equal distribution of information, for each problem 
each S was given one of the problem cards and two of 
the information cards. Under conditions of unequal 
distribution of information, the S in Position A was 
given one of the problem cards and five of the informa- 
tion cards, and each of the other Ss was given one of the 
problem cards and one of the information cards. The 
distribution of particular items among positions was 
randomized by shuffling the white cards before each 
session. The sequence in which the problems were 
presented was varied according to a systematic latin 
square such that each of the three problems was given 
first, second, and third an equal number of times, and 
each problem preceded and followed every other prob- 
lem an equal number of times. 

With the exception of communication net and distri- 
bution of information, the procedure was exactly the 
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WHEEL SLASH CIRCLE 


Fic. 1. DIAGRAM OF THE THREE COMMUNICATION NETS 
TO BE INVESTIGATED 


Marvin E. SHAW 


TABLE 1 
CHARACTERISTICS OF THE EXPERIMENTAL NETS 


| Individual | Maximum 
Sum of| Net Centralities Differ- 
Dis- |Central- - }, Saete 
tances | ity ~ |. egiviaal 
A|B c;D ities 
3 18 16.8 3.66.03.63.6 
14 16.4 |3.54.74.73.5 
16 16.0 |4.04.04.04.0 


same for all groups. Each of the four Ss was brought 
into the experimental room separately, seated accord- 
ing to a randomized schedule, and requested to remain 
quiet. The Ss were told the “names” of the members 
of the group (i.e., the color used to identify each mem- 
ber), the ones with whom each could communicate, 
the method of communication, and the general nature 
of the tasks. 

When all three problems had been completed, each 
S was asked to fill out a three-part questionnaire. 
Part 1 required sociometric choices of the member of 
the group most preferred for six different activities 
ranging from one almost identical to that just com- 
pleted to one almost completely unrelated to the ac- 
tivity just completed. Part 2 required each S to in- 
dicate on a nine-point scale how well he liked his job 
in the group, how he would compare his position with 
each of the other positions, how well he thought the 
group members cooperated with each other, and how 
he would rate the performance of the group. Part 3 
required an answer to the question, “Did your group 
develop a system ? If so, describe it briefly.” 


RESULTS 


The results of this study fall under four 
general headings: (a) time scores, (6) com- 
munication units, (c) errors, and (d) question- 
naire results. The first three were taken as 
indicators of efficiency; the questionnaire 
responses were used as indicators of morale 
and leadership. 

Time scores. Time to reach a solution can 
be measured in at least three ways: (a) the 
fastest single correct trial, (6) the average 
time per problem per position, and (c) the 
total elapsed time per problem. Analyses of 
variance applied to the results using the first 
two time measures revealed no significant 
differences. The means of total elapsed time 
per problem are presented in Table 2. Analysis 
of variance showed significant differences 
between trials (p < .001). This probably 
represents learning to use the net, since all 
the problems used were well within the ex- 
periences of the Ss. There were no over-all 
differences between nets or between distribu- 
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TABLE 2 


MEANS OF ToTAL ELAPSED TIME PER PROBLEM 
IN MINUTES 


Equal Distribution Unequal Distribution 
Trials 
Wheel) Slash | Circle! Wheel Slash Circle 


1 16.12¢ 16.2 
5 


2t 17.71¢ 16.29 16.57¢ 16.05¢ 
2 |11.68 10.98 | 9.30 11.57 11.31 10.10 
3 10.09 | 8.52* 8.26* 9.30 8.61* 7.77* 


* Separated as stragglers. 
t Separated as a homogeneous group. 


tions of information. However, the nets X 
trials X distribution interaction was significant 
(p < .01). 

In order to determine as nearly as possible 
just where the differences occurred, Tukey’s 
gap test (7) was applied to the mean scores.°® 
The results of this test are indicated in Table 
2 by the symbols placed beside the separated 
means. The scores for the first trial differed 
as a group from those for all other trials, 
indicating that most of the learning occurred 
during the first trial. On the last trial, the 
slash and the circle required less time to reach 
a solution than did the wheel, regardless of 
information distribution. However, a precise 
relationship between net centrality and time 
was not found. 

The mean time required by each position is 
shown in Table 3. Analyses of variance for 
each net taken separately yielded essentially 
the same results for all three nets. Distribution 
as a main effect was not significant, but posi- 
tions and the groups X positions interaction 
were both significant (p < .001) for all three 
nets. The results of the gap test are indicated 
by symbols. Under conditions of equal dis- 
tribution of information, only Position B in 
the wheel was separated. This means that 
only relatively large differences in individual 
centrality produce differences in time required 
to reach a solution. Under conditions of 
unequal distribution of information, the 
position receiving the most information 
(Position A) required significantly less time 
to reach a solution than did any other position. 

Communication units. Communication units 
were measured by counting the number of 
separate items transmitted by the Ss in a 


5In this test and in all subsequent applications of 
the gap test, the .05 level of confidence was accepted. 


TABLE 3 


MEAN Time REQUIRED BY Eacu PosITION 
IN MINUTES 


Positions 
Net “= 
B = D 
Wheel 
Equal 
Unequal 
Slash 
Equal . 
Unequal .56} 
Circle 
Equal 
Unequal 85} 


9.73° 
10.91 


11.14 
11.71 


11.06 
8.61} 


10.02 
10.17 


10.43 
11.35° 


.37 10.21 
10.13 


9.93 
10.05 


* Separated as stragglers. 
t Separated by a significant gap. 


TABLE 4 


MEAN NouMBER OF ITEMS REQUIRED 
SOLUTION BY Eacu Net 


TO REACH A 


Equal Distribution | Unequal Distribution 

Trials = ' nen - 
Wheel | Slash | Circle; Wheel Slash | Circle 

1 2.3" 
2 58.0 
3 44.9* 


74.9 7 
70.1 | 5 
62.5  o& 


6. 51.4* .6* 
». 48.4* 
). 43 .3* 


* Separated as stragglers 


given net or position, since it was possible for 
one card to contain several such items.* An 
item is here defined as any simple sentence, 
or any meaningful part of a complex or com- 
pound sentence. Table 4 presents the mean 
number of items transmitted by each net. 
Analysis of variance indicated that there were 
significant net differences (p < .001), trial 
differences (p < .001), and second-order 
interaction effects (p < .001). Examination 
of Table 4 reveals that the chief difference 
between nets was that the wheel required 
fewer items to reach a solution than did the 
slash and the circle. Generally, the number 
of items per trial decreased with practice in 
much the same way as did the time scores. 
The significant interaction is apparently due 
to the relatively large number of items trans- 
mitted by the wheel (equal distribution) on 
Trial 2, and by the slash (unequal distribu- 
tion) on Trial 1. For items transmitted, as 


* Results of an analysis of number of cards used 
were essentially the same as the results reported here. 
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TABLE 5 
MEAN NuMBER OF ITEMS COMMUNICATED PER 
PROBLEM FOR Eacu POsITION 
Positions 
Net 


B 


Wheel 
Equal 
Unequal 
Slash 
Equal 9 14. 
Unequal . 13. 
Circle 
Equal 15. 16.7 17.4 15 
Unequal 18. 15.4 16. 15. 


* Separated as stragglers 
+ Separated by a significant gap as a homogeneous group. 
t Separated by a significant gap 


for the time scores, net centrality was found 
to be an inadequate index. 

The mean number of communication items 
per problem for each position is shown in 
Table 5. Analyses of variance for each net 
taken separately showed that in the slash 
positions differed significantly (p < .001), and 
in the wheel positions and the distribution X 
positions interaction were significant (p < 
.001). Examination of the means in Table 
5 shows that in all nets the individual cen- 
trality indices were related to the number 
of items communicated, with those positions 
having the higher individual centrality send- 
ing more messages. In the wheel and the slash, 
the person receiving the most information 
sent more items than did the equivalent posi- 
tion under conditions of equal information 
distribution. Again, an inverse relationship 
was found between the time required to reach 
a solution and the number of items trans- 
mitted. 

Errors. Two types of errors were computed: 
(a) decision errors, defined as an incorrect 
answer reported by a member of the group, 
and (6) information errors, defined as the 
transmission of an item of information dif- 
ferent from that contained in, or derivable 
from, the items of information given to the 
group as a part of the problem to be solved. 
Since the error scores were decidedly skewed 
and the variance was heterogeneous, the 
Mann-Whitney U test (6) was used instead 
of the usual analysis of variance. Only the 


differences produced by the information dis- 
tribution variable were significant (p < .001). 
The mean number of decision errors was 2.6 
and 2.1 for equal and unequal distribution, 
respectively; the mean number of information 
errors was 9.8 and 6.2 for equal and unequal 
distribution, respectively. This means that 
unequal information distribution decreased 
the number of both decision and information 
errors. 

The corrective power of the nets as meas- 
ured by the ratio of decision errors to informa- 
tion errors is given in Table 6. Analysis of 
variance showed significant differences be- 
tween nets (p < .05), but information dis- 
tribution had no effect. The gap test showed 
that the wheel differed significantly from the 
other two nets; i.e., the wheel was less effec- 
tive in this respect, since a larger numerical 
value indicates less corrective power. This 
finding is in agreement with the findings with 
respect to time scores and number of items 
transmitted: the wheel differs from the other 
nets but there is no precise relationship be- 
tween corrective power and net centrality. 

Questionnaire results. The data obtained 
from the questions requiring sociometric 
choices of the S most preferred in various 
situations were analyzed for each net taken 
separately. In the wheel, 36 chi squares were 
computed to test this hypothesis and 28 of 
these were significant (p < .05). In the slash, 
19 chi squares were significant (p < .05) out 
of 24 computed. In the circle, only 4 chi squares 
were significant (p < .05) out of 48 computed. 
These results offer strong support for the 
hypothesis that Ss in the positions having the 
higher centralities are chosen more frequently 
than Ss in positions having lower centralities. 

Comparisons of choices under the two condi- 
tions of information distribution yielded only 


TABLE 6 
CorRRECTIVE PowER OF Nets: Ratio oF DECISION 
Errors TO INFORMATION Errors* 


Equa! Distribution Unequal Distribution 
Measure 7: 
Wheel Slash Circle Wheel § Slash (Circle 
Mean 51 .33 | .30 a7 34 25 
SD .59 .39 | .33 83 .37 | .41 
* Groups making no information errors were omitted when 
computing corrective power. 
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nonsignificant chi squares. The S in the posi- 
tion receiving the most information was 
chosen more frequently under conditions of 
unequal distribution of information than was 
the equivalent position under conditions of 
equal distribution of information in 16 of the 
18 comparisons made, but the differences were 
small and hence did not result in a significant 
change in the over-all distribution of choices. 

Examination of the sociometric choices 
revealed a similar pattern with respect to the 
number of times a single S was chosen unani- 
mously as the one most preferred. Under 
conditions of equal information distribution, 
the mean number of unanimous choices was 
6.5, 3.2, and 0.8 for the wheel, the slash, and 
the circle, respectively; under conditions of 
unequal information distribution, the cor- 
responding means were 6.0, 1.5, and 2.3. The 
nonparametric test developed by Friedman 
(3) and described by Wilcoxon (8, pp. 7-9) was 
used to analyze these data. Over-all differences 
were highly significant (p < .001), but when 
net and distribution effects were isolated, 
only the net differences were significant (p < 
.01). However, the net X distribution inter- 
action was significant (p < .01), indicating 
the effects of the net were different for dif- 
ferent distributions. 

Analyses of both the number of times 
chosen and the number of unanimous choices 
indicate that emergence of a leader, as indi- 
cated by sociometric choice, is directly related 
to the centrality index when only the equal 
information distribution situation is con- 
sidered. In the unequal distribution of in- 
formation situation, the number of unanimous 
choices was increased in the circle, decreased 
in the slash, and essentially unchanged in the 
wheel. This finding indicates that emergence 
of leadership is determined by the number of 
positions in the net having the highest in- 
dividual centrality (inverse relationship) and 
the maximum difference in individual centrali- 
ties (direct relationship). 

The mean ratings per net for the questions 
dealing with job satisfaction, positions com- 
parison, group cooperation, and group per- 
formance were analyzed by applying analysis 
of variance to each question taken separately. 
The results may be summarized as follows: 
(a) Ss in the wheel enjoyed their job in the 
group significantly less than did the Ss in 


the slash and in the circle (p < .001). (6) As 
a group, the Ss showed no significant differ- 
ences in their ratings of the relative value of 
positions in the net. (c) The ratings of group 
cooperation showed slight but significant net 
differences (p < .05); the Ss in the wheel 
rated the cooperation of their group lower 
than did the Ss in the slash who in turn rated 
the cooperation of their group lower than did 
the Ss in the circle. (d) The Ss in the wheel 
rated the performance of their group lower 
than did the Ss in the slash and the circle 
under all conditions; under conditions of equal 
distribution of information, the slash and the 
circle did not differ in this respect, but under 
conditions of unequal information distribution, 
the Ss in the slash rated the performance of 
their group higher than did the Ss in the 
circle (p < .05). 

Taken together, the rating scale results 
indicate that under conditions of equal in- 
formation distribution, centrality was not 
clearly related to group morale (where morale 
is defined as general satisfaction with the 
group’s activity). Unequal distribution of 
information affected morale in only one of 
the three nets; it seems safe to conclude that 
information distribution is not an effective 
variable as far as group morale is concerned. 
These results are in agreement with the find- 
ings with respect to time scores, communica- 
tion units, and corrective power in that the 
wheel differed from the other two nets and 
in that the information distribution variable 
had little effect upon the group scores. 

The rating scale data were also analyzed 
according to the ratings of individual posi- 
tions, with the following results: (a) There 
were no significant differences between the 
ratings of job satisfaction and group coopera- 
tion. (6) The ratings of various positions dif- 
fered significantly (p < .01) under all experi- 
mental conditions, and the distribution X 
positions interaction was significant (p < 
.05) for the wheel and the circle. The positions 
having higher centrality indices were rated 
higher than those positions having lower 
centrality indices under both conditions of 
information distribution and in all three 
nets. In the wheel and the circle, the position 
receiving most of the information was rated 
higher under conditions of unequal distribu- 
tion than was the equivalent position under 
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conditions of equal information distribution. 
(c) In the slash, all positions rated the group 
performance higher under conditions of un- 
equal information distribution than under 
conditions of equal information distribution 
(p < .001). 

These results lead to the conclusion that 
there is some relationship between individual 
centrality and individual morale, but this 
relationship is tied very closely to the position 
preference component. The information dis- 
tribution variable produced equivocal results: 
additional information either had no effect 
upon individual morale (as indicated by job 
satisfaction and group cooperation ratings) 
or increased the individual morale (as shown 
by the positions comparisons ratings in the 
wheel and the circle and the group performance 
ratings in the slash). 

Responses to the question concerning group 
organization revealed that all groups used one 
of two forms of organization: (a) all informa- 
tion was sent to one S who solved the problem 
and sent the answer out to the other Ss, or 
(6) all information was sent to all Ss and then 
each S solved the problem independently. The 
first method was used by only one group in 
the slash and by only one group in the circle, 
but was used by eleven groups in the wheel. 
These data do not support the earlier findings 
that high net centrality leads to more con- 
sistent group organization. 

DISCUSSION 

The data from this study make it clear that 
centrality correlates with behavioral measures 
only when differences among positions within 
the net are considered. Similarly, altering the 
distribution of information changes the be- 
havior of Ss in the various positions, but has 
little effect upon the performance of the group. 

Let us examine some of the positive rela- 
tionships among the variables in the present 
study. Subject to the limitations imposed by 
the conditions under which the data were 
collected, the results of this experiment offer 
some evidence for the following hypotheses: 

1. The time required for an individual in a 
position to complete an activity varies in- 
versely with individual centrality. 

2. Individual morale, number of items trans- 
mitted by an individual in a position, and the 
probability that an individual in a given 
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position will be chosen as a leader vary di- 
rectly with individual centrality. 

3. The emergence of a leader in a net (as 
indicated by unanimous choice) is determined 
by the number of positions having the highest 
individual centrality index in that net and 
the maximum difference in individual centrali- 
ties. 

4. Increasing or decreasing the amount of 
information initially assigned to an individual 
in a given position has an effect similar to 
respectively increasing or decreasing the in- 
dividual centrality index of that position. 

How can these relationships be explained? 
Leavitt believed that centrality was effec- 
tive primarily because it determines the de- 
gree of independence of the individual in a 
position. The present study offers no evidence 
against this explanation, but there is another 
possible explanation of the relationship be- 
tween individual centrality and morale that 
perhaps should be mentioned. This view would 
assume that centrality, because of its cor- 
relation with number of channels,’ determines 
the amount of communication activity en- 
gaged in by an individual and hence his 
enjoyment of the task. This draws some 
support from an ad hoc analysis of time per 
message which showed that Ss with higher 
centralities spent significantly less time per 
message than did Ss with lower individual cen- 
tralities (p < .001). The implicit assumption 
is that an individual enjoys a task which keeps 
him occupied more than he does a task 
which permits relatively long periods of inac- 
tivity. 

The reason for the relationship between 
individual centrality and number of items 
transmitted becomes obvious when it is re- 
membered that individual centrality and 
number of channels are perfectly correlated 
and that the most frequent method of solu- 
tion was for each S to send all available in- 
formation to all other Ss with whom he was 
connected. Thus it was inevitable that more 
channels would require more messages. 

It has been pointed out that the emergence 
of leadership in a net is determined by the 
number of positions having the highest 
centrality index in the net and the maximum 


7 Centrality and number of channels are perfectly 
correlated in the nets used in this experiment, but this 
is not true for all possible nets. 





INFORMATION AND PERFORMANCE IN COMMUNICATION NETS 553 


difference in individual centralities. The inter- 
pretation would seem to be that the less 
ambiguity associated with the distinction 
between the position having the highest 
individual centrality and the other positions, 
the more agreement there is among group 
members regarding which S is most desirable 
for the leadership role. 

The effect of the communication net upon 
corrective power requires some explanation. 
It appears that in the wheel the only ways in 
which an error could be detected would be 
for an S to detect his own error, for an S to 
pass on to another S a set of computations 
containing an error, or for an S to pass two 
conflicting statements to the same S. In the 
slash and in the circle an additional oppor- 
tunity for error detection was possible: an 
S might receive information about the same 
objects or events from two or more different 
sources. This added opportunity for error 
detection and correction probably accounts 
for the superiority of the slash and the circle 
in this respect. 

In summary, it is believed that individual 
centrality determines the behavior of in- 
dividuals in various positions within nets via 
the limitations which it imposes on these 
individuals. Centrality, however, bears no 
simple relationship to independence of action. 
The underlying processes which determine 
behavior vary with the particular aspect of 
behavior which is being measured. 


SUMMARY 

The purposes of the present experiment 
were (a) to test the effects of unequal dis- 
tribution of information upon leadership, 
morale, and problem-solution efficiency in 
various communication nets, and (6) to check 
the findings of previous investigators, using 
a set of slightly more complex problems and 
different communication nets. It was believed 
that the primary source of differential forces 
would be centrality, and that increasing 
or decreasing the amount of information 
initially assigned to a given position would be 
equivalent to respectively increasing or de- 
creasing the centrality index of that position. 

Fifteen groups were randomly assigned to 
each of six experimental conditions, repre- 


senting the six possible combinations of two 
types of information distribution (equal and 
unequal) and three communication nets (the 
wheel, the slash, and the circle). Each group 
was required to solve three eight-item prob- 
lems. 

The results were interpreted as supplying 
some evidence for the following hypotheses: 

1. Time required for an individual in a 
position to complete an activity varies in- 
versely with individual centrality. 

2. Individual morale, number of items 
transmitted by an individual in a position, 
and the probability that an individual in that 
position will be chosen as a leader vary di- 
rectly with individual centrality. 

3. The emergence of leadership in a net 
varies inversely with the number of positions 
having the highest individual centrality index 
in that net and directly with the maximum 
difference in individual centralities. 

4. Increasing or decreasing the amount of 
information initially available to an individual 
in a given position has an effect similar to 
respectively increasing or decreasing the 
individual centrality index of that position 
or the number of channels available to that 
position. 
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SOME EFFECTS OF UNEQUAL DISTRIBUTION OF INFORMATION 
IN A WHEEL GROUP STRUCTURE! 
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University of Wisconsin 


N THE immediately preceding report, 

Shaw (1) investigated some effects of 

unequal! distribution of problem-related 
information in group structures. In general, 
he found that information distribution within 
structures had no effect upon over-all struc- 
ture measures of time, message units, errors, 
or morale although decreasing or increasing 
the amount of information initially available 
to an individual within a structure had an 
effect similar to respectively decreasing or 
increasing the individual centrality index of 
that position. 

There are, however, other distributions 
which affect the generality of this finding. In 
any structure having central person(s) in- 
formation may be distributed so that a 
peripheral person receives more or so that a 
central person(s) receives more. Shaw’s ex- 


periment tested only the former possibility: 
the present experiment tests both possibilities 
using the four-person wheel structure. 


PROCEDURE 

The apparatus, problems, questionnaires, and pro- 
cedures were identical to those of Shaw. 

The Ss were students enrolled in psychology and/or 
sociology courses at the University of Wisconsin. 
Forty-five groups of four members each were run. 
Thirty of these groups were composed of females and 
fifteen were composed of males. Each of these groups 
was randomly assigned to one of three experimental 
conditions, subject to the restriction that each experi- 
mental condition be assigned ten groups of females and 
five groups of maies. The three experimental conditions 
were: (a) equal distribution, (6) unequal distribution 
to the periphery, and (c) unequal distribution to the 
center 

Each group was required to solve three problems 
randomly assigned one at a time. Under the condition 
of equal information distribution, for each problem 
each S was given a card stating the problem and two 
cards giving information required in the problem’s 
solution. Under conditions of unequal distribution, 
the S in the appropriate position (A in the peripheral 
condition and B in the central condition. See the 
“wheel” in Fig. 1 of Shaw’s report) was given one of 
the problem cards and five of the information cards, 





1 This research was supported in whole or in part 
by the United States Air Force under Contract AF 
18(600)-340, monitored by the Human Resources 
Research Institute, Maxwell Air Force Base, Alabama. 


and each of the other Ss was given one of the problem 
cards and one of the information cards. 


RESULTS 


A simple within and between analysis of 
variance performed upon the times for the 
single fastest correct trial yielded an insignifi- 
cant value of F. 

Analyses of variance were performed upon 
the times of the fastest position per trial and 
slowest position per trial. Distribution was 
found to have no effect, either main or inter- 
action. The significant effects in both analyses 
were trials and groups within. The former 
indicated that learning was occurring; the 
latter that the sets of four persons differed 
from each other. (The groups within term is 
equivalent to an individual differences term 
in the usual psychological design.) 

A further breakdown of time scores involves 
positions within the structures. The time 
measures here for the main effects of distribu- 
tion and trials and their interactions differ 
from the previous two analyses in that here 
the design results in mean time appearing as 
means for those variables, i.e., the mean of the 
time of completion of the four positions’ times. 
Trials are again significant, and show a learn- 
ing effect. Positions and distribution X posi- 
tions are also significant in addition to in- 
dividual differences terms. Because position 
when tested by its interaction with distribution 
is not significant, only the interaction means 
are given in Table 1. Note particulariy that 
the central position is affected adversely as to 
time under unequal central distribution. 

The measure of communication was the 
message unit defined as a simple sentence or a 
meaningful part of a compound or complex 
sentence. The significant terms of the analysis 
of variance are discussed below. Trials show, 
once again, a learning effect, i.e., fewer message 
units per trial. Position, remaining significant 
over and above the distribution X position 
interaction, shows that the central person 
sent 3.3 times as many message units as the 
other positions. Table 2 summarizes the means 
of the distribution X position interaction. 


554 





UNEQUAL DISTRIBUTION OF INFORMATION IN A WHEEL STRUCTURE 


TABLE 1 


MEANS AND RESULTS OF TuKEY’s Gap Test (2) OF 
Time Scores Broken Down By POSITIONS 


Information Distribution 
Position z 
Unequal 
Central 


— Unequal 
— Peripheral 
9.05* 
10.91 
11.13 


11.14 
10.54 
11.37 


A 11.04 
B 9.74* 
c&D 11.18 


* Separated as a straggler (p 


TABLE 2 
MEANS AND RESULTs OF TuKEY’s Gap TEST OF THE 
DISTRIBUTION X PosiT1iON INTERACTION OF 
NuMBER OF MESSAGE UNITS TRANSMITTED 


Information Distribution 
Position aT -. . 
Unequal Unequal 
Peripheral Central 
A 9.69 12.24f 7.31 
B 29 .60* 27 .0O*t 31.82* 
C&D 10.11 7.88 7.d8 


* Separated by a significant gap 


Equal 


~ < .0S) as a homogencous 
group 
t Separated as a straggler (p < .05). 


Errors were analyzed as in Shaw’s experi- 
ment. No significant differences were found on 
any of the error measures. 

Analysis of the sociometric questions using 
chi square demonstrates that information dis- 
tribution did not change the pattern of soci- 
ometric choices on the six questions. Of the 24 
chi squares computed between distributions, 
only one was significant. These chi squares 
were computed for (a) the A position choosing, 
(6) the B position choosing, (c) the C and D 
positions combined choosing, and (d) all 
positions combined choosing for each ques- 
tion. 

On the other hand, chi squares computed 
within information distribution groups show 
that the choices reflect the group structure: 
Positions A and C plus D chose B (p < .05 for 
25 of 36 chi squares). Position B chose ran- 
domly among A, C, and D. 

The rating-scale data were subjected to 
analyses of variance. It was found that posi- 
tion was effective on three of the four ques- 
tions. In each instance, the B position yields 
the high mean, i.e., is more satisfied with the 
job, considers its status higher, and believes 
the group cooperated better than the other 


TABLE 3 
MEANS OF DisTRIBUTION X PosiITION INTERACTION 
ON JoB SATISFACTION AND POSITION 
Status RaTINGs* 


Job Satisfaction Position Status 


Position Un- Un 
sa) equal equal 

Equal Periph-| Cen- 

eral tral 


J Lies 
Equal ery Unequal 
jua eriph 

eral 


Central 


12.9 
16.2 
9.0 


10.8 
18.9 
10.3 


B 5 a7. 
C&D : 4 9; i 


A q 6.5; §.2 10.4 
5 
7 


* Tukey’s gap test yields no more information in this instance 
than does the analysis of variance 


three positions. Of most interest, however, are 
the significant interactions of distribution X 
position on the first two questions. The means 
of these interactions are given in Table 3. Note 
that every rating change follows the connected 
propositions that individual morale is directly 
related to individual centrality and that in- 
creased or decreased information status is 
similar to respectively increased or decreased 
individual centrality. 

Categorizing the answers to a question re- 
garding the groups’ development of a system 
showed no differences between the information 
distribution conditions when tested by chi 
square. Sixty-seven per cent of the groups, 
however, adopted the system of sending all 
information to everyone and checking each 
other’s answers. In the remaining 28 per cent 
which developed a system, the central posi- 
tion, B, received all information and trans- 
mitted only answers. Two groups claimed that 
no system was developed. 


DISCUSSION 


The data show that individual centrality 
is (a) directly proportional to individual 
morale and to the probability that an in- 
dividual will be chosen as a leader and (6) 
inversely proportional to individual problem- 
solution times where there is equal distribu- 
tion of information, but that there are devia- 
tions under other information-distribution 
conditions. Since the argument will be based 
primarily upon individual-position time scores, 
they are discussed first. 

Table 1, it will be recalled, presented the 
means of the position X information-distribu- 
tion time scores. Shaw’s formulation that 
increasing or decreasing information at a 
position is similar to respectively increasing 
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or decreasing the centrality index of that 
position is upheld there as long as compari- 
sons are made within distribution conditions.” 
Once comparisons are made between distribu- 
tions, however, failure to follow Shaw’s propo- 
sition is apparent. The most noticeable failure 
is at position B in the unequal central distribu- 
tion which should be the lowest mean and, 
instead, is the third lowest. This negative 
instance must be accounted for, and a probable 
answer is found in the communication require- 
ments placed upon B by the nature of the 
information distribution. 

The groups usually adopted the system of 
“information to everyone.”’ Under this sys- 
tem, B must send a minimum of 17, 18, and 21 
message units in the unequal peripheral, 
equal, and unequal central conditions re- 
spectively. As seen in Table 2, the message 
units actually sent by B order directly in the 
sequence given by these minima.* Times of 
the B position, though, order equal, unequal 
central, unequal peripheral. These orderings 
suggest that there is an interaction between 
individual centrality and communication re- 
quirements: that, when the number of required 
message units passes a certain optimal output 
level, it begins to counteract the effects of 
individual centrality. Such a statement may 
best be summarized with the term “‘satura- 
tion’ since it is not simply and directly re- 


2 There is some question as to the comparative time 
of positions A and B in the unequal peripheral condi- 
tion. Should this distribution so readily overcome the 
structural centrality effect as to produce the time dif- 
ferences between those positions that it did? The dif- 
ference between these means is associated with a 
p< .05. 

*The differences between the means are almost 
identical with the differences between the minima. 
This suggests, since the absolute values of the means 
are higher than those of the minima, that the channels 
carry a constant amount of “noisy” and/or redundant 
messages. Examination of Shaw’s data for three nets 
shows the same effect appearing there. 

* Two kinds of what we have called saturation may 
be distinguished: one we call “channel saturation” 
and refers to the number of channels with which a 
position must deal; the other, discussed above, is 
“message unit saturation.” The former should be 
closely related to both centrality and group size and the 
latter to centrality, problem size, and information dis- 
tribution. Each of the two main classes may break 
down into “input” and “output” saturations. If these 
variables are basic ones and the above textual state- 
ments are correct, they should ultimately affect leader- 
ship emergence and morale. Experiments now in 
progress bear upon these points. 
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lated to the number of message units required 
to be transmitted as is shown by the A posi- 
tion in the unequal peripheral distribution. 
Thus, information distribution is indirectly an 
effective variable by virtue of its effects upon 
what is, perhaps, a more basic phenomenon, 
saturation, or the input and output require- 
ments which it places upon positions within a 
group structure. 

One other comment is pertinent. Although 
morale follows almost perfectly Shaw’s 
original statement as to the effects of informa- 
tion distribution, the emergence of leader- 
ship does not: information distribution did 
not affect leadership emergence. Examination 
of Shaw’s data indicates that that conclusion 
was based upon the findings in structures 
where each position has two or more channels 
terminating upon it. Information distribu- 
tion, then, has an effect upon leadership 
emergence in structures having lesser dif- 
ferences in centrality than does the wheel 
structure. 


SUMMARY AND CONCLUSIONS 


An experiment was conducted using a four- 
person wheel group structure to test the effect 
of information distribution upon problem 
solving, leadership, and morale within the 
structure. Three conditions of information 
distribution were used: equal, unequal with 
most information initially located at the 
center of the structure, and unequal with a 
peripheral position initially receiving the 
most information. 

The data substantiate the position that 
the individual centrality index is directly re- 
lated to the emergence of leadership in the 
structure and individual morale. The findings 
with respect to information distribution, how- 
ever, suggest that it is not simply and di- 
rectly related to the individual centrality 
index. Rather, it appears to be only indirectly 
related thereto and mediated by a more basic 
phenomenon which has been termed “‘satura- 
tion.” 
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THE EFFECT OF ONE PARTNER’S SUCCESS IN A RELEVANT 
TASK ON THE INTERACTION OF OBSERVER PAIRS! 
BERNARD MAUSNER 


University of Massachusetts 


HE PRESENT report is based on one of a 

series of studies of the determinants of 

social interaction in judgment. The 
paradigm for these investigations is that of 
the Sherif experiment in which it was demon- 
strated that the judgments of subjects (Ss) 
reporting the extent of autokinetic effect 
converged toward a group norm. It has al- 
ready been found that the degree of this con- 
vergence can be affected by such factors as 
prior reinforcement of the responses (2, 4, 7), 
variation in the psychophysical method em- 
ployed (1, 5), variation in the nature of the 
stimuli (3), and manipulation of the prestige 
of one of the partners through his identifica- 
tion as an “‘expert”’ or “nonexpert”’ (6). With 
reference to the last of these factors, there 
exists the problem of specifying in as precise 
a manner as possible the variables determining 
prestige. When this is done, prestige may serve 
as a useful intervening variable in the theoret- 
ical treatment of social interaction. 

The writer (8) has suggested that con- 
vergence in judgment is a function of the 
summation of two conflicting response tenden- 
cies: (a2) a tendency to continue to make 
previous judgments, and (5) a tendency to 
agree with a partner. The partner’s prestige 
is probably one of the determinants of the 
latter tendency. Prestige may be strengthened 
where the partner gives specific evidence of 
competence in the kind of judgment being 
made; it may be weakened where evidence of 
incompetence is presented. Other things being 
equal, this variation in prestige should result 
in a concurrent strengthening or weakening 
of the tendency to agree with the partner. 
Where the opposing tendency, that to main- 
tain an earlier range of judgments, is of known 


1 The present report is the fourth in a series entitled, 
“Studies in Social Interaction.”” The data were pre- 
sented at the 1953 meetings of the Eastern Psycholog- 
ical Association. The experiment was carried out in the 
laboratory of the department of psychology, Wash- 
ington Square College, New York University. The 
assistance of the following students who served both as 
confederates and as experimenters is gratefully ac- 
knowledged: Leon Dreier, Peter Sciaki, Marilyn 
Reisman, Sandra Skolnick, and Barbara Diamond. 


strength, or is judged to be randomized in a 
group of Ss, convergence should be predictable 
from information concerning the partner’s 
demonstrations of competence. 

On the basis of these considerations a 
specific hypothesis was tested: Ss will show a 
greater tendency to converge in judgment toward 
partners who have previously demonstrated 
success in a related task than toward partners 
who have previously demonstrated failure. 


METHOD 


Subjects. The Ss were 28 undergraduate students in 
elementary psychology classes at Washington Square 
College, New York University. All were naive with 
respect to the purposes of the experiment and the area 
of the investigation. 

Procedure. In a preliminary session each S$ judged 
the length, in inches, of each of a series of 100 lines. 
These consisted of ten lines varying in length from 10 
to 19 in., each presented ten times. Each block of ten 
trials contained all ten lengths arranged in a different 
permutation from that used in other blocks. The 
judgments were made in a dark room; the lines were 
presented by means of an apparatus in which a lighted 
slot }4 in. wide and varying in length was exposed to S 
for approximately 1 sec. (for complete description cf. 4). 

Following this session (the Alone session), a con- 
federate of E was presented as a second “subject.” 
Both Ss were given 20 trials in which they were in- 
structed to move a stylus through a 17-cm. slot placed 
behind a curtain. They then attempted to reproduce 
this length by moving the stylus through an equivalent 
distance in a free slot, also hidden. The actual S$ com- 
pleted the task first and was followed by the confed- 
erate. Each S was requested to observe and record his 
partner’s performance. A success was defined as re- 
production within 1 cm. of the correct distance. The 
confederates were instructed to perform very skillfully 
for some Ss, very unskillfully for the remainder. Those 
Ss for whom the partner’s reproductions had an SD 
of 1.5 cm. or better were considered to have had a 
“successful” partner (group S); those for whom the 
SD of the reproductions was more than 1.5 cm., an 
“unsuccessful” partner (group U). The Ss were as- 
signed to these groups in rotation. Group S contained 
17 Ss, group U, 11 Ss. 

After this procedure had been completed, both Ss 
judged the lengths of the same series of lines previously 
presented (Together series). They alternated as first 
to respond to each stimulus. The confederates had 
been instructed by E to make their judgments vary 
consistently by 4 to 5 in. from those given by the actual 
S in the alone situation. 

Treatment of results. Since one of the two persons in 
each pair was E’s confederate, and, by instruction, was 
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forced to maintain a relatively constant range of judg- 
ments, the degree of social interaction in judgment was 
estimated by measuring the degree of shift in judgments 
made by the other member of the pair, the actual S, 
in two ways. First, a D score was calculated for each S. 
This measure is derived from the difference in frequency 
with which each number response was made in the alone 
and together situations. A high D score indicates con- 
siderable shift, and, where this is toward the partner, 
considerable convergence. A low D score indicates 
maintenance of the same range of judgments in the 
together as in the alone session (see 4, 6, for detailed 
method of calculation). In the second method of analy- 
sis, graphs of median judgments per block of ten 
trials were used to show direction of shift and as a 
basis for qualitative descriptions of the progress of 
interaction through the course of the experiment. 


RESULTS AND DISCUSSION 


Table 1 gives D scores for the Ss of both 
groups. The mean D score for group S is 
62.1, for group U, 20.2. This difference is 
significant at the .01 level (¢ = 3.10, m = 26, 
p < .01). This indicates greater shift in judg- 
ment from alone to together situations for 
the Ss whose partners were successful (group 
S) than for those whose partners were unsuc- 
cessful (group U). Inspection of the median 
graphs shows a tendency to formation of a 
social norm of judgment in the pairs of group 
S, a lack of such tendency in the pairs of 
group U. Figure 1 shows three such plots for 
Ss in group U; Fig. 2 four plots for Ss in 
group S. These are representative of the 10 
Ss in group U and the 12 in group S whose 
behavior corresponded with the prediction.? 
Thus the hypothesis presented above is sub- 
stantiated by the data of this experiment. 

Six of the 28 Ss do not show the predicted 
behavior. Four of the Ss in group S have D 
scores below the mean for group U; one (SWI) 
is borderline. These are clearly distinguished 
from the remainder of the Ss in this group 
both by their D scores, and by the failure of 
their graphs to show formation of a social 
norm. The Ss’ own performance on the in- 
terpolated task was not at all related to the 
degree of social interaction. Among the five Ss 
in group S who failed to converge none was 
particularly accurate in “blind” slot reproduc- 


?Graphs of median judgments per block of ten 
trials for the remainder of the Ss have been deposited 
with the American Documentation Institute. Order 
Document No. 4113 from the ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Congress, Washington 25, D. C., remitting in ad- 
vance $1.25 for 35 mm. microfilm, or $1.25 for 6 x 8 
in. photocopies. Make checks payable to Chief, Photo- 
duplication Service, Library of Congress. 
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TABLE 1 


INTERACTION BETWEEN S AND E’s CONFEDERATE 
(P) Grven By D Score; Accuracy oF SLoT REPRO- 
DUCTION BY SD or Trias; Group S: PARTNER’S 
SD < 1.50, Group U: PartNner’s SD > 1.50 


Group S 


Slot Reproduction 


SD(S) SD(P) 
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Group U 





SPO 

THE 

AUE 

BAS 

CAL 

FOT 

MER 10 
MAS 16 
jus 31 
ECK 32 
HOL* 66 





tion. The two most accurate Ss have the 
highest and the third highest D scores. The 
most inaccurate S in the entire experimental 
group (FOT) is found in group U, and has an 
appropriately low D score (12); i.e., he failed 
to converge. 

Unfortunately, there is no further informa- 
tion available in the data of this experiment to 
throw light upon the behavior of these deviant 
Ss. However, their occurrence may be pre- 
dicted from the theoretical scheme alluded to 
above. In any group of Ss chosen at random it 
is likely that, owing to personality factors 
or other variables not immediately identifiable, 
some will have a high tendency to maintain 
previous judgments. The opposing tendency 
to agree with the partner is probably not too 
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Fic. 1. MEDIAN JUDGMENTS PER BLOCK OF TEN TRIALS First ALONE AND THEN WITH A PARTNER 
UNSUCCESSFUL IN SLOT REPRODUCTION. PLOTS FOR THREE Ss ARE INCLUDED. 


high to begin with where the partner is an 
anonymous fellow student. It would be un- 
likely, therefore, that a manipulation of the 
partner’s prestige, such as that here attempted, 
would be effective 100 per cent of the time. 
The fact that it was effective in 71 per cent 
of the cases in group S, and that a significant 
difference in D scores was found between the 
two groups, argues for the sensitivity of the 
convergence experiment as a measure of social 
interaction. 

Apparently the effect of a partner’s failure 
in lowering the tendency to agree is even more 
pronounced than the effect of his success in 
raising it, since all but one of the Ss in group 
U failed to converge. That deviant case 
(HOL) had a D score (66) actually above the 
mean for group S. Although there is no cor- 
roborating evidence, it is possible that the 
fact that HOL is the sole Negro S is significant. 
There may be factors in the developmental 
history of Negro Ss which would lead to a 
heightened tendency to agree with a white 
partner. 


Thus two kinds of conditions antecedent to 
prestige can now be identified. Prestige as 
measured in terms of convergence in judgment 
has been anchored to a partner’s group mem- 
bership (7) and may now also be anchored to 
a partner’s demonstrations of competence in 
relevant tasks. The next step in these investi- 
gations should be an attempt to identify and 
measure some of the genetic factors which 
relate prestige to tendency to agree. It may be 
suggested that, for most people, agreement 
with prestigeful individuals has been posi- 
tively reinforced, and disagreement negatively 
reinforced. That is, most Ss used in these 
experiments have received reinforcements 
from agreeing with members of an “expert” 
group, or with people who demonstrate 
competence. This argument leads to a testable 
hypothesis. It may be predicted that Ss will 
converge toward partners with whom agree- 
ment has previously been positively reinforced 
in some other context, but will not converge 
toward partners with whom agreement has 
been negatively reinforced. If such a relation- 
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ship were found to hold, the kinds of prestige S to agree with him, and thus finally of the 
effects demonstrated in the present and earlier degree of convergence in judgment. 
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SOME PERSONALITY CORRELATES OF STIMULUS 
GENERALIZATION UNDER STRESS 


CHARLES W. ERIKSEN 
The Johns Hopkins University 


N THEIR recent review of research on 

psychological stress, Lazarus, Deese, and 

Osler (11) have pointed out the impor- 
tance of the individual differences component 
in performance under stress conditions. In 
some subjects (Ss) the introduction of stress 
results in poorer performance, but in others 
performance is unaffected or even improves. 
The purpose of the present experiment was to 
investigate some of the personality correlates 
of these individual differences in response to 
stress. Specifically, the problem here was the 
relation of the hysteria-psychasthenia dimen- 
sion, as measured by the respective scales in 
the Minnesota Multiphasic Personality In- 
ventory (MMPI), to the degree of stimulus 
generalization shown under the stress of 


strong electric shocks. The Ss were trained to 
make a simple horizontal arm moveinent when 
a square of a certain size was presented, and 


the amount of generalization of this response 
to squares of other sizes was determined. 

Previous research has indicated that learn- 
ing and memory performance under stress 
produced by “self-esteem” threat is related 
to the hysteria-psychasthenia variable. The Ss 
scoring high on hysteria have been found to 
have poorer recall of failures on self-esteem- 
involving tasks (6, 15) and poorer memory for 
anxiety-arousing stimuli (1) than Ss scoring 
high on psychasthenia. However, in the 
present experiment not only was a different 
aspect of performance measured, but the 
method of inducing stress was the threat of 
painful physical stimulation rather than a 
threat to the Ss’ self-esteem. To see what 
relationship, if any, might be expected between 
performance and the hysteria-psychasthenia 
variable under these changed conditions, we 
need to consider briefly some of the factors 
involved in the concept of psychological 
stress. 

We can equate psychological stress with 
anxiety and analyze it into the following 
sequential components: 


Stimulus situation — anxiety response — S’s 
response to his internal anxiety-produced 
stimuli 


Such an analysis helps us to bear in mind some 
of the factors involved in stress and to see what 
effect changes in the stress conditions will 
have upon different performance measures. 

The stimulus situation alone provides ample 
opportunities for individual differences to 
occur in stres¢ research. It has been shown 
(12) that differences in goals and prior ex- 
periences may render a given stimulus anxiety 
arousing for some Ss and not for others. More 
pertinent to our present problem is how dif- 
ferences in the stimulus situation limit or 
affect S’s response to his internal anxiety cues. 
Responses are made not only to the internal 
anxiety cues, but they must also be adapted 
to the external anxiety-provoking situation. 

Although a number of different stimulus 
situations have been used to stress Ss, most, 
if not all, represent variations upon one or the 
other of two basic types. One is to threaten 
Ss with painful physical stimulation such as 
electric shocks. The other is to threaten self- 
esteem by failure or the threat of failure on 
some highly competitive task. Both types of 
situation and their variations have been suc- 
cessful in making Ss anxious, but it is to be 
noted that a given response may not be equally 
effective to both types of threat. People differ 
in the defenses they use and in the way they 
have learned to handle and interpret anxiety. 
We may expect that some of these learned 
responses are more appropriate to one type 
of stress than to the other. Avoidance defenses 
such as forgetting failures or “psychologically 
leaving the field’ may be effective methods of 
evading anxiety under situations of self-esteem 
threat, but they are quite useless when it 
comes to evading electric shocks or other 
physical punishment. 

In the present experiment, where anxiety 
was induced by electric shocks, the prediction 
was made that Ss scoring high on the hysteria 
scale would show more stimulus generaliza- 
tion than Ss scoring high on psychasthenia. 
This prediction was arrived at from two con- 
siderations. First, Ss high on hysteria, de- 
prived of using their avoidance-type of defense 
by the nature of the present stress situation, 





562 


should be more subject to uncontrolled anx- 
iety. The Ss high on psychasthenia, on the 
other hand, would appear from previous re- 
search and from the content of the MMPI 
items to be more accustomed to coming to 
grips with anxiety on the conscious level. 
Their supposed defenses of rationalization and 
intellectualization should be more compatible 
with the present stress situation. 

The second consideration was that the 
greater the uncontrolled anxiety, the greater 
the stimulus generalization. Meyer (13) has 
recently summarized evidence indicating that 
one effect of anxiety is to lower the threshold 
for responses, and experiments by Hilgard, 
Jones, and Kaplan (10) and by Rosenbaum 
(14) have suggested a relation between anxiety 
and stimulus generalization. 


METHOD 


Design. The Ss were trained to respond with a hori- 
zontal arm movement when a square of a certain size 
(training stimulus) was presented. After this response 
had been well established, squares of different sizes 
(generalization stimuli) were occasionally presented to 
Ss instead of the training stimulus, and Ss’ responses 
or tendencies toward responses were recorded. Two 
experimental conditions were used. Under the avoidance 
condition, Ss were instructed that on some of the trials 
when the training stimulus appeared a shock circuit 
would be on. On these trials they would receive a shock 
if their response to the training stimulus was not rapid 
enough. Under the monavoidance condition, Ss were 
also told that some of the time when the training stim- 
ulus appeared a shock circuit would be on, but here 
they were given to understand that the shocks were 
completely unrelated to their performance. In other 
words, no matter what they did, when the shock circuit 
was on, they would be shocked. 

These two experimental conditions were used to 
determine what effect the factor of evitability of shock 
might have upon performance and also as a control 
for Ss who might deliberately decide to respond to every 
stimulus in order better to avoid receiving shocks. As a 
further control against this latter possibility, Ss in 
both conditions were told that, although there was no 
shock for incorrect responses, the experimental session 
would continue until their performance satisfied a 
criterion of speed and accuracy.! 

The stimuli were three squares having a projected 


' Although the above procedure is quite similar to 
that used by Rosenbaum (14), it differs from his in two 
important respects. Rosenbaum failed to provide a 
control against Ss who may deliberately decide to 
respond to every stimulus, and he also did not inform 
Ss that the shock circuit would not always be on when 
the training stimulus appeared. An attempt by the 
present experimenter to duplicate exactly Rosenbaum’s 
procedure revealed that a number of Ss deliberately 
responded to every stimulus since they “had nothing 
to lose’”’ because errors were not punished. Some Ss also 
discovered that the shocks were arbitrarily adminis- 
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size of .75, 1.12, and 1.50 in. square. These sizes were 
found to be equally spaced on the equal discrimin- 
ability scale described by Garner and Hake (8). Be- 
tween Ss the large and small square were alternated 
as the training stimulus. The Ss were run individually. 
Before starting the experiments, S was shown the three 
stimuli, one at a time, to insure that he could correctly 
identify each. Twenty training trials were then given 
in which the training stimulus was the only stimulus 
presented. Five test trials for the training stimulus and 
for each of the two generalization stimuli followed the 
training trials without interruption. An additional 42 
training trials were interspersed among the test trials 
in such a manner that each test trial was separated 
from the other test trials by three training trials. Thus 
for an S who was trained on the large square, the ex- 
perimental sequence would be: 20 successive presenta- 
tions of the large square, a test presentation of the large, 
medium, or small square, three more presentations of 
the large square, and so on. Two counterbalanced 
sequences were employed with the stimuli in the test 
trials to minimize sequence effects in the generalization 
gradients. 

Subjects. A shortened version of the MMPI, con- 
taining the items for the hysteria, psychasthenia, lie, 
and K scales as well as the items for Taylor’s scale of 
manifest anxiety (17), was administered to 120 males 
enrolled in the introductory course in psychology. 
The Ss used in the experiment were selected from 
among those who scored above the 80th decile 
(hysterics) and below the 20th decile (psychasthenics) 
on the distribution of scores obtained by subtracting 
each individual’s score on the psychasthenia scale from 
his score on the hysteria scale. This subtractive pro- 
cedure was used since it had been previously found to 
be effective (1, 6) as a way to partial out a general 
factor of “neuroticism” which contributes to high 
scores on both scales. A total of 18 hysterics, 12 in the 
avoidance condition and 6 in the nonavoidance, and 
21 psychasthenics, divided 14 and 7 between the 
avoidance and nonavoidance conditions, constituted 
the experimental Ss. 

Apparatus. The S was seated at a table 2 ft. away 
from a 3 X 4ft. vertical plywood partition. In the 
lower left side of the partition was a horizontal slot 
16 in. wide and 11 in. long. In the right-hand side of 
the partition, at eye level, was an 8 X 8-in. ground 
glass window. The stimuli were presented to S by 
projecting them on the window. On a ready signal 
from E, S inserted a pencil into the slot on the left- 
hand edge where the pencil made contact with a 
microswitch. When the training stimulus appeared, S 
made a rapid horizontal movement of the pencil to 
the right, all the while keeping the pencil in the slot. 
The extent of the required movement was 23 cm. 
(approximately 9 in.), which was indicated by a 
marker on S’s side of the partition. The S’s response 
was recorded by the tracing of the pencil on recording 
paper mounted behind the partition. The stimuli were 
projected by a 35-mm. projector. Through the use of a 
condenser-time-delay circuit the duration of exposure 
of each stimulus was 3 sec. 





tered by E since, in accord with the arbitrary schedule, 
they would sometimes be shocked when they had made 
a very rapid response and not shocked when their 
response was unusually slow. 
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The latency of S’s response was measured to the 
nearest hundredth of a second by a Standard Electric 
Timer. The timer was activated simultaneously with 
the projector and was stopped when S broke contact 
with the microswitch at the left-hand edge of the slot 
If S failed to respond, the timer automatically stopped 
when the projector circuit was interrupted by the 
timing-delay apparatus. 

Electric shock was administered to the back of S’s 
left hand by hand electrodes. The source of shock was a 
faradic current from an inductorium supplied by a 
6-v. dry cell. Strength of shock varied slightly between 
Ss, since the attempt was made to keep the strength 
of shock just below the level where S would refuse to 
continue with the experiment. 


RESULTS 

For each S the mean response amplitude on 
the test trials was computed for each of the 
three stimuli. The means of the individual S 
means are plotted in Fig. 1, by experimental 
condition, for the hysteric and psychasthenic 
groups. While there is essentially no dif- 
ference in response amplitude between groups 
or conditions to the training stimulus (0), the 
amount of generalization to the two non- 
training stimuli (stimulus 1 and 2) is con- 
sistently greater in the hysteric group under 
both experimental conditions. Also both the 
hysteric and psychasthenic groups show more 
generalization under the avoidance than under 
the nonavoidance condition. There is little 
difference, however, in the amount of generali- 
zation shown to stimulus 1 and 2. 

Statistical analysis supports the graphical 
interpretation of the data. Since the variance 
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TABLE 1 


SUMMARY OF THE ANALYSES OF VARIANCE FOR THE 
AVOIDANCE AND NONAVOIDANCE CONDITIONS 


. Mez 5 
Source . fean df s? 
Square ~ 


Avoidance Condition 


Groups 280.72 72 4.76** 

Stimuli 26.90 90 2.15 

Individuals 1415.32 24 58.97 

Groups X Stimuli inter- 
action 

Pooled Ss X Stimuli in- 
teraction 


Total 2024.35. 51 
1 
1 


300.64 24 
Nonavoidance Condition 


Total 

Groups 

Stimuli 

Individuals 

Groups X Stimuli inter- 
action 

Pooled Ss X Stimuli in- 


teraction 42.70 11 a. 





* Significant at the .05 level (one-tailed test 
** Significant at the .02 level (one-tailed test) 

of the response amplitudes was significantly 
less in the nonavoidance condition, it was not 
appropriate to combine the two experimental 
conditions in a common analysis of variance. 
Instead a modified three-way analysis (groups, 
the two generalization stimuli, and individ- 
uals) was performed separately for the avoid- 
ance and nonavoidance conditions. The results 
of these analyses are summarized in Table 1. 

For both experimental conditions, dif- 
ferences between the hysteric and psychas- 
thenic groups were significant beyond the .05 
level for a one-tailed test in the predicted 
direction. In neither condition was there a 
significant difference between the generaliza- 
tion stimuli. Nor were the interactions sig- 
nificant. In Table 2 will be found the means 
of the individual S means by groups and ex- 
perimental conditions along with their ac- 
companying ¢ tests. 

Because of a significant difference in the 
variance of responses under the avoidance and 
nonavoidance conditions, it was not possible 
to compare the difference in over-all generaliza- 
tion between these two conditions by the 
standard / test. Instead the method suggested 
by Cochran and Cox (2) was used; this yielded 
a ¢ value of 5.31, significant beyond the .001 
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TABLE 2 

MEAN RESPONSE AMPLITUDES IN CENTIMETERS BY 

Groups AND EXPERIMENTAL CONDITIONS TO THE 
TRAINING AND GENERALIZATION STIMULI 


j . - 
} : Ca Stet Nonavoidance 
Avoidance Condition Condition 
| — 
, Stimuli Stimuli 
Group en 
| 


land 
2 


0 —- 1 2 com- 
bined bined 
25.0 4.8 (4.7 |9.6 
25.2 |1.1 (1.7 |2.8 


42/2. 28°°|1. 34/1. 85° 


Hysteric 24.0 |14.4 12.7 (27.1 
Psychasthenic|24.1 | 9.5 8.3 17.7 
‘ O1| 2.17°*| 1.81°| 2.18°° 


* Significant beyond the .05 level (one-tailed test). 
** Significant beyond the .02 level (one-tailed test). 


TABLE 3 
MEANS OF THE MEDIAN RESPONSE LATENCIES IN 
HUNDREDTHS OF A SECOND BY GROUPS AND 
EXPERIMENTAL CONDITIONS TO THE TRAINING 
AND GENERALIZATION STIMULI 


Nonavoidance 


Stimuli 


Avoidance 


Group Stimuli 


0 1 


.26 | .23 .34 | .26 | .26 
30 | .24]. 36} .3S | .31 


Hysteric 


Psychasthenic 





level for a two-tailed test. Thus it seems safe 
to conclude that the avoidance condition 
results in greater stimulus generalization. 
The median latency for positive responses 
during the test trials was determined by Ss 
for each stimulus. The means of these median 
latencies are given in Table 3 by group and 
experimental condition. Three trends can be 
noted in this table. First, the latencies to the 
generalization stimuli are shorter than to the 
training stimulus. Second, the response laten- 
cies for the hysterics are consistently shorter 
than for the psychasthenics, and third, the 
latencies for the avoidance condition are 
shorter than for the nonavoidance. While 
none of these trends is supported by statistical 
test, the shorter latencies for the responses to 
the generalization stimuli are similar to the 
findings of Gibscn (9) and Rosenbaum (14). 
They are probably to be expected since the 
less time an S takes for discrimination, the 
greater the likelihood of a wrong response. 


DISCUSSION 
The above results are in good agreement 
with the prediction. The hysterics under both 


CHARLES W. ERIKSEN 


avoidance and nonavoidance conditions show 
a greater degree of stimulus generalization 
than the psychasthenics. The demonstration 
of a difference between these two groups, how- 
ever, does not insure that our theoretical 
basis for expecting this difference was correct. 
While the prediction for the present study was 
arrived at from the assumption of a difference 
in the way hysterics and psychasthenics 
respond to the internal cues of anxiety, other 
explanations are readily available for the 
present results. Further study of such extreme 
groups is required before we can be confident 
as to the factors underlying the differences in 
performance. 

Irrespective of what may be the basis for 
the obtained differences in stimulus generaliza- 
tion, the present findings are in keeping with a 
growing amount of experimental work show- 
ing the relation of the hysteria-psychasthenia 
variable to various aspects of performance 
under stress. In addition to the previously dis- 
cussed relation of this personality variable to 
learning and memory performance, Davis 
(3, 4) has shown characteristic differences in 
performance on a psychomotor task between 
hysterics and psychasthenics when stress was 
induced by rapid pacing on the task. Results 
closely related to those of the present study 
are reported in an experiment by Deese, 
Lazarus, and Keenan (5). They found that Ss 
high on the MMPI hysteria scale showed 
impairment in nonsense syllable learning under 
stress produced by electric shocks, whereas 
the performance of Ss high on the psychas- 
thenia scale was little affected. 

The success of the hysteria-psychasthenia 
variable in predicting various aspects of 
performance under different stress conditions 
would appear to establish it as an important 
dimension of personality. As such there is a 
need for a clearer definition of the variable in 
personality and behavioral terms. In a previ- 
ous article (6) the similarity of the hysteria- 
psychasthenia variable and Eysenck’s factor 
of introversion-extroversion (7) was pointed 
out, but as yet there is insufficient evidence 
for positively identifying it with Eysenck’s 
factor. 

It is interesting to note that the composite 
hysteria-psychasthenia scale used in the 
present study is quite highly correlated with 
the Taylor scale of manifest anxiety (r = 
—.72, N = 120). The magnitude and direc- 
tion of the correlation are such as to show that 
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Ss on the hysteria pole are low scorers on the 
anxiety scale and those on the psychasthenia 
pole are high scorers. Taylor (16) and Taylor 
and Spence (18) have shown that high- and 
low-anxiety groups selected on the scale of 
manifest anxiety differ in performance in 
conditioning and verbal maze learning. They 
have interpreted these results along Huillian 
lines with anxiety being considered as a 
generalized drive influencing performance 
through the building of habit strength. 

The present experiment and that of Deese 
et al. (5) suggest that the Taylor and Spence 
interpretation of the anxiety scale is too simple 
an interpretation. If the drive interpretation 
of anxiety were correct, we would have ex- 
pected the hysteric group in the present ex- 
periment to show the least stimulus generaliza- 
tion and the psychasthenic group the most. 
In fact, however, the opposite held true. This 
finding, together with the other evidence men- 
tioned above, suggests that the Taylor scale of 
manifest anxiety is not so much a measure of 
anxiety as it is a measure of a personality 
variable relating to the way individuals re- 
spond to or express their anxiety. 


SUMMARY 


The amount of stimulus generalization under 
experimentally induced anxiety was deter- 
mined in an hysteric and a psychasthenic 
group of Ss, selected on the basis of these 
respective scales in the MMPI. Three dif- 
ferent size squares were used for the training 
and generalization stimuli, and the response 
was a simple horizontal arm movement. Two 
experimental conditions were used. Under the 
avoidance condition Ss were instructed that 
failure to respond rapidly would result in 
their receiving a strong electric shock. Under 
the nonavoidance condition Ss were told that 
they would receive electric shocks randomly, 
irrespective of their performance. In fact, 
however, both conditions had the same 
number of shocks administered in a predeter- 
mined order. 

The hysteric group of Ss showed signifi- 
cantly more stimulus generalization under 
both experimental conditions. Also for both 
the hysteric and psychasthenic groups, the 
avoidance condition resulted in significantly 
more generalization. The results were con- 
sidered in relation to a broad concept of 


stress, and it was suggested that the Taylor 
scale of manifest anxiety was not so much a 
measure of anxiety level as it was a measure 
of how Ss respond to or handle their anxiety. 
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TACTUAL PERCEPTION OF FORM BY THE BRAIN-INJURED'! 


ALAN O. ROSS 


Yale University? 


ost of the tests used by the psy- 
chologist for the diagnosis of brain 
injury employ visual or auditory 
stimuli (8, 9, 17). The neurologist, on the 
other hand, frequently uses tactual stimuli 
for the examination of patients suspected of 
being brain-injured, particularly when the 
lesion is believed to be in the posterior parietal 
area, where injury is said to result in an in- 
ability to recognize forms tactually (astereog- 
nosis) (3, 6, 11). These neurological techniques, 
however, are rarely objective and never quan- 
titative. It is the purpose of this study to de- 
termine whether an objective, quantitative 
tactual test of perception of raised outlines 
of forms can be used to discriminate the per- 
formances of brain-injured from those of 
normal human adults. 
A test was designed on the principle that 
outlines of a form can be presented in varying 


degrees of completeness and that the degree 
of completeness can be expressed in terms of 
the number of tactual stimuli of which the out- 
line is composed and the distance between 


these stimuli. Since brain-injured patients 
have been reported to exhibit impaired tac- 
tual perception, especially when the parietal 
lobe is involved, and since brain-injured pa- 
tients are said to have difficulty in responding 
to two simultaneously presented stimuli (2, 5, 
13, 17), two hypotheses concerning their per- 
formance on the tactual test may be advanced. 

Hypothesis‘ I. Patients with brain injury 
require a greater degree of completeness of a 
form outline than normals in order to recognize 
the form tactually. 

Hypothesis II, Patients with brain lesions 
differentiate less well than normals between 
two forms presented simultaneously through 
the tactual modality. 


1 Part of a dissertation presented for the degree of 
Doctor of Philosophy at Yale University. The valuable 
advice and critical comments of Col. Charles S. Gersoni 
and the members of the doctoral committee, Drs. S. B. 
Sarason, H. E. Rosvold, and J. R. Wittenborn, are 
gratefully acknowledged. 

? Now at Brooke Army Hospital, Fort Sam Heuston, 
Texas 
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METHOD 
The Tests 


The experiment consisted of two test series. In the 
tactual tests, the stimuli were thumbtack heads out- 
lining letters, numbers, and well-known symbols on 
individual plywood boards, presented in one or two 
open-ended boxes which screened the boards from S’s 
view 

Identification series. The main test made use of eleven 
forms which S had to identify. Six of the forms were 9 
in. by 9 in., five were 4 in. by 4 in. in their largest di- 
mension. Each form could be presented in four steps of 
decreasing order of difficulty. The first step of each form 
consisted of the outline of the form made up of the 
least number of tacks which could be used without de- 
stroying the essential characteristics of the form. On 
each subsequent step the outline consisted of more tacks 
until, with the fourth step, the form was completely 
outlined by adjoining tacks. The four steps thus dif- 
fered in the number of tactual stimuli and the distance 
between these stimuli. 

Each of the 11 forms of the main test was followed 
by a lactual multiple-choice recognition test bearing 
upon the form which it followed. One of these forms 
was a miniature wire replica of the form represented by 
the particular set, the other three were different forms 
which had some aspect, such as a distinct angle, in 
common with the form represented in the set. 

A visual multiple-choice recognition test was pre- 
sented once, at the end of the identification series. This 
test consisted of a card on which the 11 forms used in 
the main test were reproduced in line drawings. The 11 
test forms were randomly scattered among 29 other 
forms from which they were to be chosen. 

Discrimination series. This consisted of ten paired 
figures of which each pair was presented simultane- 
ously. Using both hands, S had to say whether the two 
figures were the same or different. This series consisted 
of letters or numbers outlined with tacks, placed adja- 
cent to one another. Five pairs contained identical 
forms, while the other five contained forms which were 
similar in some respect. 


Procedure 


Subjects were examined individually. Each S$ was 
told that the examination was in the nature of a survey, 


§ Details of the procedure as well as illustrations and 
descriptions of the stimulus boards, the boxes, and other 
material are contained in an appendix which has been 
deposited with the American Documentation Institute. 
Order Document No. 4164 from ADI Auxiliary Publi- 
cations Project, Photoduplication Service, Library of 
Congress, Washington 25, D. C., remitting in advance 
$2.00 for 35 mm. microfilm or $3.75 for photocopies. 
Make checks payable to Chief, Photoduplication Serv- 
ice, Library of Congress. 
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assurance being given that the results would not in any 
way bear upon his status as a patient or as a soldier.‘ 
He was then asked to indicate his dominant hand and 
to place this hand in one of the screening boxes. A brief 
tactual screening test was given in which S had to 
identify by touch three out of four items of everyday 
use. 

Identification series, main test. For this test the in- 
structions stressed the fact that each of the boards had 
a figure or a form on it, that these forms had a meaning 
which could be described, and that S was expected to 
tell what the figure was and to reproduce it by drawing. 
The S was assured that the form was always a single 
object which could be recognized if seen. Two sample 
boards were shown visually to acquaint S with the con- 
struction of the forms. A time limit of one minute was 
set for each board and S was encouraged to guess if 
necessary. 

The first step of a form was placed in the box. As 
soon as S gave a response, or at the end of one minute, 
the board was withdrawn and S was asked to draw 
what he thought the form looked like. If S identified the 
form correctly,’ either by naming or by drawing, or 
both, the factual multiple-choice board was placed in the 
box and S was asked to examine the four figures and to 
point to the one which had the same shape as the main 
test form just completed. This was followed by the first 
step of the next form in the series. 

An incorrect response or a failure to respond to a 
board on the main test was followed by the next step 
in the set of four boards until either a correct response 
was obtained or the time limit for the fourth step had 
expired, and S had thus failed to recognize the particu- 
lar form on any of its four steps. In this case the next 
form in the series was presented, starting with the 
first step. 

After completion of the last form of the main test, 
the visual mulliple-choice recognition test was presented 
and S was instructed to point to all the forms he remem- 
bered having worked on. A time limit of half a minute 
was set for this test. This completed the identification 
series 

Discrimination series. For this series S was instructed 
to place one hand in each box and to state whether the 
two forms on the boards he was to feel were the same or 
different. Time limit for each pair was half a minute. If 
S delayed his response, he was encouraged after 23 sec 
onds; after a response was obtained or the time limit 
had expired, the boards were replaced by the next pair 
in the series. 

After completion of the discrimination series, S was 
given the Bender-Gestalt Test (1, 12) as part of another 
study and the CVS Individual Intelligence Scale (10). 
The CVS contains the Comprehension and Similarities 
subtests of the Wechsler-Bellevue scale (20) and a short 
vocabulary test, based on the Stanford-Binet word list 
(16, 18). The entire experimental procedure never 
lasted longer than 1 hour and 45 minutes. 


Scoring 
The main test of the identification series was scored 
by assigning a value of 4 to a correct response on the 





4The verbatim instructions are contained in the 
appendix (see footnote 3). 

5 For criteria see section on scoring and appendix 
(see footnote 3). 
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first step of a form; a value of 3 to a correct response 
on the second step; 2 for the third; and 1 for the fourth. 
Failure to give a correct response on all steps of a form 
earned the score of zero.* Since one form was used as an 
introduction and was not scored, the maximum total 
score for the main test of the identification series was 
4), the minimum score zero. This score will be referred 
to as the main score in the following discussion. 

On the tactual multiple-choice recognition test, a 
score of 1 was given for each correct choice, leading to a 
possible maximum score of 10. The visual multiple- 
choice recognition test was scored by giving 1 point for 
each design correctly recalled. The maximum score for 
this test was 11. Each of these subtests was scored and 
analyzed separately and no wver-al! total score was 
calculated for the series. 

The discrimination series was scored by crediting 1 
point for each correct discrimination, for a total possible 
score of 10 


Subjects 

The Ss were patients from the neurosurgical, neuro- 
logical, and rehabilitation services of three Army hos- 
pitals. The test was administered to every patient 
whose major diagnosis was craniectomy, regardless of 
cause, and who was present during the week in which 
the hospital was visited. A total of 31 patients were 
examined. Seven of these had to be discarded before 
analysis of the data, leaving a sample of 24.’ Discards 
were necessitated by the diffuse nature of some of the 
lesions which made localization impossible, by the pa- 
tients’ emotional state interfering with testing, or by 
the fact that diagnoses on which the initia] selection had 
been based had implied surgical procedures which, upon 
closer study of the medical history, were revealed as not 
having taken place. 

Detailed abstracts of the medical and operative his- 
tories of the patients were obtained and, wherever pos- 
sible, the Army General Classification Test score (19) 
or the Armed Forces Qualification Test score was se- 
cured from personnel records. The site of the lesion was 
ascertained by close scrutiny of operative notes and 
from skull X rays, encephalograms, and surgeons’ state- 
ments. 

The normals were obtained from the service person- 
nel stationed at an Army hospital. In order to find a 
match for each patient, controlling for intelligence, age, 
years of Army service, and years and geographic region 
of education, 42 normals were selected from the avail 
able population by reference to a personnel roster. 
These Ss were screened for histories of trauma or dis- 
ease which might have affected their brain structure.® 
Five cases had to be eliminated because some residual] 
brain damage was suspected, leaving 37 normals who 
were tested. 

From this pool of normals, a match for each of the 
24 patients was drawn. The intelligence measure used 
for the matching was the CVS Individual Intelligence 
Scale (10). The data pertaining to the matching varia- 





® Methods and criteria used in scoring are described 
in the appendix (see footnote 3). 

7? Detailed information on the 24 patients and on the 
discarded cases is contained in appendix Tables 1 & 2 
(see footnote 3). 

® See check list in the appendix (see footnote 3). 
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TABLE 1 
MATCHING VARIABLES 
24 pairs; variables matched for same 
pairs throughout) 


(N = 


Variable Mean 

Intelligence 
Normals 
Patients 

Age (years) 
Normals 
Patients 

Education (years) 
Normals 
Patients 

Service (years) 
Normals 
Patients 


78 1.02 
79 |1.12 
.24 |1.68 


.76 |1.54) .74 


7 
2 
Note.—None of the differences is significant. All of the cor- 
relations are significant below the .01 level 

bles are summarized in Table 1. The mean difference ¢ 
test and the F test for homogeneity of variance indicate 
that the two groups were probably drawn from the same 
population. The high product-moment correlations 
show that the individuals within the groups were 
closely matched. 

Civilian occupations of normals and patients were 
similar in distribution,® so that a possible bias because 
of occupational experience with tactual tasks could be 
ruled out. All Ss passed the tactuai screening test, thus 
eliminating peripheral anesthesia as explanatory of the 
findings. 

RESULTS 

Identification series, main lest. According to 
the principle on which the test was designed, 
the main score is an expression of the degree of 
completeness required by an S in order to 
recognize a tactually presented form outline. 
Should the patient group score significantly 
lower than the normal group, Hypothesis [, 
that patients with brain injury require a 
greater degree of completeness of a form out- 
line than normals in order to recognize the 
form tactually, would be confirmed. The 
main-score mean for the patients is signifi- 
cantly below the mean for the normals with a 
p below .01 (Table 2), supporting Hypothesis 
Fine 

Both verbal and graphic responses were 
credited as correct in order to avoid a handicap 
to the patients who might have had verbal 

* See appendix Table 3 (see footnote 3) 

1 Since the hypotheses predict differences in one 
direction, one tail of the distribution of ¢ was used in 
all comparisons deriving directly from the hypotheses. 
All other tests of significance are based on two tails of 
the distribution. 


difficulties of an aphasic nature. The number of 
Ss in each group who drew the figures before 
naming them should indicate the extent of 
such a handicap. A chi-square analysis shows 
that there is no significant relationship be- 
tween this manner of responding and member- 
ship in the patient group (chi square = .82, 
df = 2). 

The lower main score of the patients might 
be the result of their slowness in responding. A 
comparison of the patients’ mean response time 
per board with that of their respective controls 
indicates that there is no significant difference 
between the groups (Table 2). A further com- 
parison, relevant to the question of slowness, 
deals with the number of times Ss had boards 
withdrawn from them because of expiration of 
the time limit. A chi square of .003, df 2, 
indicates that there is no association between 
brain injury and slow performance. The time 
factor was apparently not responsible for the 
poorer performance of the patients. 

Patients had significantly lower scores than 
normals when only the first five forms of the 
main test of the identification series are con- 
sidered (Table 2). Since this includes no more 
than 20 minutes of testing time, the fatigue 
factor was apparently not responsible for the 
lower score of the patients on the test as a 
whole. 

The correlation between the main scores 
and the CVS scores, which represent a post- 
injury intelligence measure for the patients, 
is .53 (p < .01) for the normals and .26 (p 
> .10) for the patients. The two correlations 
do not differ significantly (z = 1.06, p = .29). 


TABLE 2 
SUMMARY OF SCORES FOR IDENTIFICATION SERIES 


Mean 


Normals | Patients 
N = 24)|(N = 24) 


Main 25.1 20.1 .44** 
First 5 forms (main) 16.1 13.9 .63** 
Mean time per board 41.8 43. my 
Tactual multiple choice 8.6 i .50 
Visual recognition 

Number right 

Number pointed to 

Number wrong 

Accuracy ratio 17 


4 

5.3 

1.5 | 1.99 
| 173 02 


Note.—Tests of significance based on one tail of distribution 
for first three comparisons; on two tails for tactual multiple-choice 
and visual recognition data. 

** Significant at the .01 level. 
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Tactual multiple-choice recognition test. The 
score on this test is a measure of immediate 
memory. The comparison reveals no significant 
difference between patients and normals 
(Table 2). 

Visual multiple-choice recognition test. Pre- 
sented at the end of the identification series, 
this test is a further measure of memory. 
Patients chose significantly fewer correct 
figures than normals (Table 2). The number of 
correct figures chosen is subject to the effect 
of several variables. The greater the total 
number of figures an S points to within the 
half minute available, the greater will be the 
probability of his including figures which are 
scored as correct. Normals pointed to sig- 
nificantly more figures than did the patients 
(Table 2). When the number of correct figures 
is weighted by the total number of figures 
pointed to, the resulting accuracy ratio fails 
to reveal a difference between patients and 
normals (Table 2). It appears that the greater 
number of correct figures chosen by the nor- 
mals is a result of their pointing to more 
figures within the time available. It may be 
possible to correct for this in the future by 
requiring a limited number of choices. Normals 


also pointed to more wrong figures than pa- 
tients, but this difference is not significant 
(Table 2). 

Occasionally an S failed to recognize a form 
on all four steps of the identification series’ 
main test. Patients tend to have more such 
failures than normals, as indicated by a chi 


square which approaches significance (chi 
square = 9.37, df = 4, p > .05). This raises 
the possibility that patients were not able to 
make as many correct choices as normals be- 
cause they were familiar with fewer forms 
which they might recognize. Having failed to 
recognize the letter R tactually, for example, 
they were unable to recognize the R on the 
visual multiple-choice test. Accordingly, per- 
formance on the visual multiple-choice test 
was rescored considering only those forms 
which an S had recognized on either the main 
test or the tactual multiple-choice test of the 
identification series. The percentage scores so 
obtained indicate that normals performed 
significantly better than patients (p < .01 
on Wilcoxon’s paired replicates test [21]). 
Discrimination series, main test. This test 
was designed as a test of Hypothesis II, re- 
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garding the differentiation between two simul- 
taneously presented forms. Four patients were 
unable to perform on this test because they 
had only one functioning hand. The remaining 
20 patients had a mean score of 6.8, compared 
to a mean score of 7.6 for the matched normals. 
The difference between the two means is 
significant at the .05 level with a ¢ of 2.10, 
supporting Hypothesis II. Further analysis 
indicated that Hypothesis IT is related to Hy- 
pothesis I. The discrimination scores corre- 
late with the identification main scores with an 
r of .47 (p < .05) for the 24 normals and with 
an r of .48 (p < .05) for the 20 organics. It is 
possible to evaluate the difference between the 
groups on the discrimination series, holding 
tactual identification ability, as measured by 
the main score, constant. A ratio which meas- 
ures performance on the discrimination series, 
independent of performance on the main test 
of the identification series, was obtained by 
dividing an S’s standard score for discrimina- 
tion by his standard main score on identifica- 
tion. The mean difference for this ratio, pa- 
tients versus normals, is not significant with a 
t of .25. 

The parietal area is said to be critical for 
tactual recognition of form (6, 11). An attempt 
was, therefore, made to determine whether 
patients with parietal lesions required a more 
complete outline of forms than patients with 
lesions in other areas in order to recognize 
the form tactually. The patient group con- 
tained only three cases with lesions restricted 
to the parietal area, but a group of 10 patients 
could be formed when all other patients where 
the parietal area was involved in the injury, 
but not exclusively, were added to the purely 
parietal cases. Only four of these cases had an 
injury in the dominant hemisphere and used 
the dominant hand. The mean main score of 
this heterogeneous group of 10, who had some 
form of parietal involvement, is significantly 
lower than the mean main score of the re- 
maining 14 patients where the parietal area 
was not involved in the injury (¢ = 1.80, 
df = 22, p < .05). The fact that the patients 
in the parietal group had injuries of various 
sizes implicating other areas to varying de- 
grees makes it impossible to attach much 
weight to this result. 

If tactual perception is considered as primar- 
ily localized in the parietal area, it might be 
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suspected that the presence of parietal cases 
in the sample of patients resulted in the low 
score of the patient group on the test on which 
the answer to Hypothesis I was based. In line 
with Hypothesis I, a new comparison of the 
main scores for the identification series was 
made, using only the 14 patients who had no 
parietal involvement. The mean for this group 
was 22.0, compared to a mean of 25.3 for their 
normal controls. The difference is significant 
at the .05 level (¢ = 2.62), indicating that the 
difference between patients and normals re- 
mains when all cases with parietal involve- 
ment are excluded so that the conclusion 
relevant to Hypothesis I is not changed. 

Differences in test performance are fre- 
quently attributed to differences in the length 
of time since operation. The mean time since 
operation (not including cranioplasties) was 
7.9 months, with a standard deviation of 3.3; 
the median was 8 months, the range from 2 
to 15 months. To test for a possible dependence 
between time since operation and main score 
on identification series, a chi-square analysis 
was made, dichotomizing the patient group 
at the median of time since operation and at 
the median of identification main scores. A chi 
square of .001, df = 1, indicates that there is 
no significant tendency for long or short time 
since operation to vary with high or low main 
scores. When a Pearsonian correlation coef- 
ficient was computed for length of time since 
operation and main score, the nonsignificant r 
of .11 (p = .70) pointed toa similar conclusion. 

Inasmuch as the patients took the Army 
classification test at the time of their induction 
into the service, the scores they obtained at 
that time represent a measure of their pre- 
morbid performance, while the CVS scores 
measure their performance after injury. The 
Army classification scores are based on one 
of two tests, the Army General Classification 
Test (AGCT) (19), or the Armed Forces 
Qualification Test (AFQT), which has super- 
seded the AGCT in recent years. The two tests 
correlate on the order of .92, and the scores 
are comparable in terms of the performances 
measured. In the following, classification test 
scores will be referred to as ‘“‘AGCT” scores, 
regardless of which of the two tests they were 
based on. 

Alternate basis for matching. By matching 
the groups on the basis of the patients’ post- 
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TABLE 3 
Groups MATCHED FOR PREMORBID INTELLIGENCE 


Mean 
Comparison t 

Nor- Pa- 

mals tients 
“AGCT” score 59.0 59.6 .62 
Age 24.0 22.8 1.27 
Education (years) 11.3 | 10.6 2.00 
Identification (main) 25.8 20.4 3.43** 
Discrimination 7.5 6.8 1.81* 


Note.—N = 20 for each group, except for discrimination scores 
where NV = 16 for each group. Tests of significance based on two 
tails of distribution for first four comparisons, on one tail for 
analysis of identification and discrimination scores. 

* Significant at the .05 level. 

** Significant at the .01 level 


injury intelligence scores it was shown that the 
lower tactual scores of the patients were in- 
dependent of any general lowering of mental 
ability. In order to investigate differences in 
tactual scores between patients and normals 
when preinjury mental ability is held constant, 
a new matching was accomplished by pairing 
each patient with a normal who had the same 
“AGCT” score. Age and years of education 
were controlled for at the same time. Table 
3 shows that the mean differences for the 
matching variables were not significant. When 
the principal comparisons relevant to Hypo- 
theses I and II are repeated with preinjury 
intelligence held constant in this manner, the 
differences in identification main scores and 
discrimination scores remain significant at the 
.01 and .05 levels respectively. 


DISCUSSION 


A tactual test was designed on the principle 
that outlines of a form can be presented in 
varying degrees of completeness and that the 
degree of completeness may be expressed in 
terms of the number of tactual stimuli of 
which the outline is composed and the distance 
between these stimuli. Two hypotheses con- 
cerning the performance of the brain-injured 
on this test were advanced. The hypothesis 
that patients with brain injury require a 
greater degree of completeness of a form out- 
line than normals in order to recognize the form 
tactually was confirmed. The hypothesis that 
patients with brain lesions differentiate less 
well than normals between two forms presented 
simultaneously through the tactual modality 
was also confirmed. 

















TACTUAL PERCEPTION OF FORM BY THE BRAIN-INJURED 


With the present form of the test, the 
scores of patients and normals have too great 
an overlap for the technique to be valuable 
for differential diagnosis of brain injury, but 
the study suggests that with greater refine- 
ment and extensive standardization, a test 
of this kind may be made clinically useful. 

The results of the study are relevant to a 
number of theories, opinions, and practices 
current in the field of brain injury. One of 
these is the view that disturbed tactual recog- 
nition of form is associated with a lesion in the 
parietal area of the cortex (3, 6, 11). The data 
obtained in this experiment indicate that func- 
tions as complex as those required in the 
tactual test used can be interfered with by 
lesions in other areas as well. Inferior tactual 
ability can be found in patients who have 
undergone brain operations, grouped without 
regard to the cerebral area affected, although 
there is indication that patients with lesions 
involving the parietal area exhibit a greater 
decrement than patients with lesions in other 
areas. 

It would appear that the technique em- 
ployed in the present study is sensitive to dis- 
turbances of a general psychological function, 
under which tactual recognition of form can be 
subsumed. If the function of the intact cere- 
brum is viewed as one of integration or syn- 
thesis (4), the present findings might be 
interpreted as indicating a disturbance of the 
integrative function of the brain-injured sub- 
jects, manifesting itself in poor test perfor- 
mance. 

The present findings are further relevant 
to the statements by several writers (2, 5, 13, 
17) that patients with brain injuries experience 
difficulty when required to differentiate be- 
tween two simultaneously presented stimuli. 
It was shown that normals perform better 
than patients when they had to differentiate 
tactually between two simultaneously presen- 
ted forms. The patients’ deficit on the dis- 
crimination task can, however, be reduced to 
the more basic deficit which operates in 
tactual recognition of form so that simul- 
taneity, as such, does not seem to be of primary 
importance in reducing the performance level 
of brain-injured. 

Impulsiveness is another characteristic oc- 
casionally ascribed to brain-injured (14, 15). 
The data for the visual multiple-choice recog- 
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nition test show that normals are significantly 
more productive than patients in terms of the 
number of forms pointed to within the time 
limit, but the greater number of wrong forms 
chosen suggests a lack of precision on the part 
of the normals. While patients point to fewer 
forms, the forms they do point to are more 
often correct than the many forms pointed to 
by the normals, so that the relative accuracy 
of the two groups is the same. The evidence 
thus points against impulsiveness on the part 
of the patients. It has also been suggested that 
brain-injured patients restructure their en- 
vironment on the level of their reduced capaci- 
ties (7). Such a phenomenon may be involved 
in the findings on the visual multiple-choice 
task. Patients appear to have reduced the 
quantity of their performance, thus main- 
taining the quality. 

It might be argued that the psychological 
reactions of the patients to the experience of 
brain surgery and hospitalization resulted in 
their lower tactual scores. Several factors 
refute such an explanation. The patients and 
the normals reacted to the testing situation in 
essentially the same manner. The instructions 
aimed at reducing test anxiety seemed to have 
had the desired effect on all but one patient, 
and he was eliminated. Besides this clinical im- 
pression, there is quantitative evidence which 
speaks against the presence of the so-called 
catastrophic reaction in the patients. Anxiety 
might slow down the performance of the pa- 
tients so that they would exceed the time 
limit significantly more often than the normals 
or show a longer mean response time per board. 
This was not the case. The opposite effect, 
that patienis were giving rash and indiscrim- 
inate responses in order to escape the situation, 
is disproved by the mean response time data 
and by the data on the visual multiple-choice 
test, where patients showed more accuracy 
and discrimination of choice than normals. 

It might be argued alternatively that the 
tactual scores are an expression of intelligence 
and that they might have been predicted from 
a knowledge of intelligence scores based on 
more traditional tests. Since the differences 
between patients and normals predicted by 
the hypotheses prevailed both when preinjury 
and when postinjury intelligence was held 
constant, it would appear that the tactual 
scores are relatively independent of intelligence 
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scores. It is pessible that for diagnostic pur- 
poses a knowledge of both tactual and intel- 
ligence scores would be useful. 


SUMMARY 


A tactual test of perception of raised out- 
lines of forms was designed on the principle 
that such outlines can be presented in varying 
degrees of completeness and that the degree of 
completeness can be expressed in terms of the 
number of tactual stimuli of which the outline 
is composed and the distance between them. 
The power of this test to discriminate the per- 
formances of 24 brain-injured patients from a 
like number of normal adults was examined. 
Patients with brain injury were shown to re- 
quire a greater aegree of completeness of form 
outline than normals in order to recognize the 
form tactually. When two stimuli were pre- 
sented simultaneously for tactual discrimina- 
tion, patients performed worse than normals, 
but when tactual recognition of form was held 
constant, the simultaneity factor appeared to 
contribute little to the difference between 
patients and normals. It was suggested, that 
the technique might be refined and standard- 
ized to make it useful for differential diagnosis 
of brain injury. 
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THE CONSISTENCY OF INTERPERSONAL BEHAVIOR JUDGMENTS 
MADE ON THE BASIS OF SHORT-TERM INTERACTIONS IN 
THREE-MAN GROUPS! 


ROBERT R. BLAKE, JANE SRYGLEY MOUTON, anp BENJAMIN FRUCHTER 
The University of Texas 


IRECT interpersonal judgments made 

by strangers on the basis of short- 

term interactions may provide one 
basis for measuring individual personal and 
social characteristics of the kind that cannot 
be easily evaluated through objective tests. 
Is such behavior sufficiently consistent and 
can such judgments be made with sufficient 
reliability to justify their further consideration 
in connection with technical problems such as 
assembling individuals into effective work 
groups? Since previous studies (e.g., Barker 
[1], Bell and French [2], and Haythorn [3)) 
bear on this issue only indirectly, it seems 
desirable to seek a direct answer to these 
questions. 

This paper reports an experiment designed 
to determine the extent to which one technique 
of direct assessment produces reliable judg- 
ments concerning the personal and social 
characteristics in individual behavior. The 
approach to estimating the reliability employed 
in this investigation is somewhat different 
from the usual method. In the more conven- 
tional reliability study conditions in the 
first administration are repeated in the second 
administration. In the present investigation 
the estimate of reliability obtained is depend- 
ent not only on the reliability of the judgments 
but also on the consistency of the behavior 
and similarity of the group structures in the 
two situations. Since these three sources of 
variation are confounded in the design, high 
consistency may be interpreted as indicating 
little variation in any of the three sources. 
In cases of low agreement, however, it cannot 
be determined which of the sources of variation 
account for it. The values obtained therefore 
can be considered estimates of the amount of 


1 This research was supported in part by the United 
States Air Force under Contract Number AF 18(600)- 
602, monitored by the 3305th Research and Develop- 
ment Group (Combat Crew Training Research 
Laboratory), Human Resources Research Center, 
Randolph Air Force Base, Texas. Permission is granted 
for reproduction, translation, publication, use, and 
disposal in whole and in part by or for the United 
States Government. 


consistency to be expected over situations 
where raters, tasks, and groups vary. 


EXPERIMENTAL DESIGN 


The general plan called for direct assessment 
and observer evaluation of the behavior of each 
of 33 subjects who interacted as members of 
three-man groups in two sessions. Changes 
between the two group sessions included 
varying the task on which members worked, 
the composition of the groups, and the identity 
of the observers. These changes were introduced 
by the use of the following experimental plan. 

The general plan shown in Table 1 called 
for eleven interaction groups for each of two 
tasks. Every group consisted of three members 
who were unacquainted with one another 
at the beginning of the two discussion periods, 
with the same individuals participating in 
different groups during both periods. That is 
to say, in the second discussion period the 
membership of the groups was changed so that 
the three who interacted with one another 
during the second period were also unac- 
quainted with one another at the beginning 
of their interaction period. In addition the 
interaction was evaluated by trained observers 
whose schedules were arranged so that no 
observer watched the same person functioning 
in two different sessions. 

The behavior of individuals composing each 
group was evaluated in two ways. At the end 
of both discussion periods the three members 
filled in a 12-item direct-assessment question- 
naire describing their own behavior and 
that of the other two members. The observers 
also filled in the same questionnaire to describe 
the behavior they had observed in the mem- 
bers. The rankings received on these question- 
naires constitute the basic data for assessing 
the consistency of behavior between and 
within the two discussion periods. 


METHOD 
Subjects 


The Ss were 33 college students. Two restrictions 
were imposed in selecting them. The first was that they 
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TABLE 1 
EXPERIMENTAL PLAN Usep TO INTRODUCE VARIATIONS 
in Task, Group CoMPOsITION, AND IDENTITY OF 
OBSERVERS FOR Two SEssIONS INVOLVING 
THE SAME 33 SUBJECTS 





Group No. 
Session "see 
I II Ill IV XI 
Members ABC DEF GHI JKL 
Observer i 2 3 4 
Members AEI BF J CGK DHL ete 
Observer 4 3 2 1 


should not be acquainted with one another prior to the 
beginning of the study. The second was that they 
should be undergraduate students, a restriction intro- 
duced in order to insure homogeneity in the sample of 
Ss who were being asked to rate one another. Twenty- 
four of the Ss were men and nine were women. 
Groups were arranged so that men and women worked 
separately 


Observers 


The observers (Os) were 11 members of a class in 
social psychology. They were pretrained for making 
these kinds of observations by participating as Ss in 
three previous discussion groups where they filled out 
questionnaires similar to the ones used in this study.” 


Group Task and Procedure 


Two tasks were used to produce group discussion. 
The groups were instructed to discuss the assigned 
topics, to come to agreement regarding a solution, and 
to write a brief report concerning the agreement 
reached. The first discussion was based on the question, 
“Should college grades be abolished?” The second 
task required the three Ss to construct one meaningful 
sentence which contained the following three words, 
to, too, and two. The second task was not so satisfactory 
as the first for producing interaction. After 15 minutes 
of interaction on the first task the groups were broken 
up and each individual was given a questionnaire. It 
included 12 items in terms of which each participant 
could rank himself and the two other members. Items 
were selected to permit assessment of personal and 
social characteristics that appeared in the small group 
discussions. In each case S was required to rank himself 
as one of the three members. The items used are shown 
in Table 2 

On completion of this phase, new three-man groups 
were created so that each individual interacted with 
two individuals different from those who had been 
members of his first group. After a 15-minute discussion 
period on the second task these groups were broken 
up and the same questionnaire was administered to 

* The class members who conducted the sessions and 
served as Os were Richard Austin, Dorothy Babin, 
Robert Bell, John Bowen, Jack Brackeen, Jack Hain, 
Julian Hill, Bette Perry, Glenn Rosson, Alice Smith, 
and Jane Woodbridge 
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them. After both sessions, Os filled in the same ques- 
tionnaire to judge the personal and social characteristics 
of the three group members. Each O ranked three 
different people on each of the two occasions, making 
their rankings without knowledge of the rankings 
made by the group members themselves or by other Os. 
Control of the previously mentioned factors pre- 
cludes the possibility of obtaining high consistency 
between rankings due to the introduction of factors 
such as previous acquaintance, recall on the second 
occasion of rankings made of other members on the 
preceding occasion, or continuity in performance based 
on discussion of the same topic. Significant agreement 
between rankings can be interpreted as due to consist- 
ency in the judgments of the behavior of Ss as evaluated 
by the technique employed. Absence of significant 
agreement cannot be interpreted as an inconsisteacy of 
judgments since the lack of agreement may be due to 
differences in the tasks or structure of the groups. 


Statistical Treatment of Data 


The distributions of rankings were trichotomized 
and chi square was used as the statistical test in making 
the various item comparisons, such as between ad- 
ministrations, between Ss, between Os, and so on, 
and the contingency coefficient (C) was computed as 
the index of the degree of relationship. A limitation 
of the scoring method is that the rankings within 
any one group are not independent of one another 
since ties were not permitted. The contingency coeffi- 
cient was preferred to the Pearson product-moment 
coefficient since it seemed improbable that the distribu- 
tion characteristics needed for applying the latter 
method, i.e., continuous normal distributions, were 
obtained in the data. They do meet the conditions for 
applying the chi-square test, however, with the excep- 
tion of low expected frequencies in some cells. 


RESULTS 
The Consistency of Judgments between Sessions 


The design of the study permits determina- 
tion of the consistency of judgments between 
discussion periods by comparing rankings 
received on the questionnaire during the 
first period with those received during the 
second. It will be recalled that the people Ss 
interacted with on the second occasion were 
different from those on the first, and that no S 
was acquainted with the other two members at 
the beginning of the two interaction periods. 
Furthermore, the topic of discussion was 
different. 

The consistency of judgments between 
sessions can be looked at from different points 
of view including the individual’s evaluation of 
his own behavior (self-self), the evaluation of 
his behavior by two other group members with 
whom he was participating (member-member), 
and the Os’ evaluations of his behavior 
(observer-observer). In addition the combined 
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TABLE 2 


Irems USED FOR 


Instructions: This questionaire is to be filled out as you 


THE Drrect ASSESSMENT OF INDIVIDUAL BEHAVIOR 


saw the members of your group interacting during this 


period. For each item you are to rank the members of the group ir order of preference. Use the number of each 


person to identify the members. Include yourself in the ranking. 





1. Rank the group members in terms of who contributed most to the group decision. 


Abbreviation used 
in text 
Contribution to 
group decision 


2. Rank the group members in terms of who did most of the clerical work. Clerical work 

3. Rank the group members in terms of who most often functioned as the leader. Leadership 

4. Rank the group members in terms of who did most to keep the group “on the ball.” On bail 

5. Rank the group members in terms of who was most agreeable and willing to go along Agreeable and will- 

with what the others wanted. ing 

6. Rank the group members in terms of who was the most dominant in the group situation. Dominant 

7. Rank the group members in terms of who most wanted to do a good job. Good job 

8. Rank the group members in terms of who was the most cooperative. Cooperative 

9. Rank the group members in terms of who was most satisfied with the group decisions. Satisfied with 

group decision 

10. Rank the group members in terms of who was most frustrated by the group situation. Frustrated 
11. Rank the group members in terms of who showed the most interest. Interest 
12 Yielded 


TABL 


. Rank the group members in terms of who yielded most to others. 


E 3 


CONSISTENCY OF JUDGMENTS BETWEEN SESSIONS 


Self-Self 
Item 
v p Cc r 
1. Contribution to group 3.60; — _ .31 | 9.57 
decision 
2. Clerical work 10.01 .05 | .48 | 4.61 
3. Leadership 20.92 .01 | .62 | 9.33 
4. On ball 10.39 .05 .49 | 6.37 
5. Agreeable and willing 3.08, — , .29; 5.57 
6. Dominant 13.18 .02, .53 | 3.76 
7. Good job 7.14 .20| .32 | 4.51 
8. Cooperative 8.16 .10 .45 | 1.80 
9. Satisfied with group de- 10.22 .05 .49 | 2.85 
cision 
10. Frustrated §.25,; — 7 | 6.35 
11. Interest 11.45 .05 51 9.71 
12. Yielded 5.04 — | .36 5.17 


ratings of the three group members (the S’s 
self-judgments and the judgments by the two 
other members) were used as a basis for 
evaluating the individual’s behavior on a 
group basis (group-group). The statistical 
summaries for these comparisons are shown in 
Table 3. 

An examination of Table 3 shows that 
there is some consistency in judgments as 
assessed by certain items on the questionnaire. 
By way of summarizing thefindings reported in 
Table 3, it seems clear that reliable judgments 


Member-Member 


Observer-Observer Group-Group 





p C x? p Cc x? p Cc 
05 .48 7.09: .2: .32 7.07 | .20 | .42 
-- .35 7.64 | .20; .33 | 12.31 | .02 | .52 
10: .47 10.90; .05 | .50 9.48 | .10 | .47 
.20 .40 4.36 .34 | 18.38 , .01 60 
— .38 5.99 .20. .39 6.12 | .20. .40 
— .32 10.36 .05 | .49 19.41 01 61 
— i 2.18 — i 4.42 — 34 
aa 23 | 11.9 | .02 | .52 3.13 _— .29 
— .28 2.73 — .28 2.39 _ .26 
.20. .40 3.81 —- - 6.94 .20 .42 
.05 .48 3.81 — aa 2.68 - ae 
~- .37 2.18 — .25 2.83 — .28 
about characteristics of individual behavior 


can be made even on the basis of extremely 
short periods of interaction. Items like /eader- 
ship (No. 3), interest (No. 11), and dominance 
(No. 6) were reliably judged in a number of the 
comparisons made. This finding supports the 
view that direct judgments about personal 
and social characteristics can be used as 
measures of certain aspects of behavior under 
conditions of short-term interaction when 
conditions in the first administration are 
different from those in the second administra- 
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TABLE 4 








Observer-Group Observer-Group 





Item (1) (2) (1) (2) 
x |p| Cc) pic ¢ |eplc!] 2 | plc 
RONEN ecin, SUNSien ie 7: © _ cae See 
Contribution to group de- | 27.27 | .O1 | .67 | 11.48 | .05 | .51 | 29.03 | .01 | .68 | 15.46! .01 | .57 
cision 

Clerical work 18.54 .01 | .60 | 14.68 | .01 | .56 | 33.27 | .01 | .71 | 24.44] .01 | .65 
Leadership 26.72 | .01 | .67 | 14.94 01 | .56 | 18.85 | .01 | .60 | 20.93 | .01 | .62 
On ball 18.88 | .01| .60| 5.48! — | .38 | 16.92} .01/ .51| 5.96} — | .39 
Agreeable and willing 2.00} — | .24} 5.50} — |} .39 | 10.12 | .05| .48| 9.61] .05 | .48 
Dominant 16.62 | .01 | .58 5.13 | — .37 | 12.05 | .02 | -92 | 14.64] .01 | .55 
Good job 55) — Se 8.72 10 | .46 1.79| — ae 4.05 | — .33 
Cooperative 1.73 | — | .22 2.48; — | .26 8.28 | .10) .45 | 6.47] .20)| .40 
Satisfied with group deci- | 4.20| — | .34| 2.68| — | .27 2.70 | — | .28 1.76} — | .23 

sion } 
Frustrated 2.35| — | .26| 5.72] — | .38| 5.17] — | .37| 2.30] — | .26 
Interest 1.33} — | .20/] 5.33] — | .37 | 6.86 | .20| .42 | 7.59] .20| .43 
YVielded 8.93 | .10 | .46 | 14.40 | .01 | .55 2.68 | _ .27 8.04 | .10 44 





Self-Observer (1) 








Self-Observer (2) 


16.68 














Self-Member (1) 











Self-Member (2) 





Contribution to group de- | 24.28 | .01 | .65 01. .58 | 10.67 | .05 | .49 | 9.99 .05 | .48 
cision 
Clerical work 34.10 | .O1 | .71 | 24.60 | .01.. .65 | 21.59, .01. .63 | 14.14 | .01 | .55 
Leadership 15.61 | .O1 | .57 | 24.37 | .01 | .65 | 11.90 | .02 | .52 | 12.88 | .02 | .53 
On ball 23.75 | .01 | .65 1.60 | — | .22 | 18.46 .01 | .60 2.51 | — | .27 
Agreeable and willing 9.65 | .05 | .48; 3.80; — | .32 7.02 | .20 | .42 | 15.03 | .01 | .56 
Dominant 7.24 20 | .42 | 13.30 .01 | .54 6.44 .20 .40 7.20 | .20 | .42 
Good job 4.33 | — | .34 4.67 | — | .35 3.19; — | .30; 3.15 — | 30 
Cooperative 6.75 | .20| .41 7.12 | .20; .42| 4.50; — | .35 7.75 | .20 | .44 
Satisfied with group deci 40; —/}.11 42); — | il 2.39 — 26 97 | — | .17 
sion 
Frustrated 8.61 | .10 | .46; 3.12; — | :29| 4.37| — | .34| 2.96; — | .29 
Interest 5.41; — | .38 8.29 | .10 .45 1.588; — 21 5.84; — | .39 
Yielded 5.99 | .20 39 4.9 — .36 3.26; — .30 6.94 | .20 | .42 
tion. This generalization seems to be true here is concerned with the relationship 


regardless of whether judgments are made by 
Os, by group members, or by Ss themselves. 
On the other hand judgments concerning 
other personal and social characteristics such 
as yielded (No. 12), desired to do a good job (No. 
7), and satisfied with the group decision (No. 9) 
were not made reliably on a between-session 
basis. While these data demonstrate the 
feasibility of using direct-assessment iech- 
niques to judge certain personal and social 
characteristics, further experimental work is 
needed to determine what kinds of characteris- 
tics are most amenable to reliable observation. 


The Consistency of Judgments within Sessions 


The second approach concerns the degree of 
agreement between different judges looking at 
the same sample of behavior. One question 


between Os’ rankings and judgments of each 
individual by the other two group members. 
Another relates to rankings given by the 
group as a whole to each member as contrasted 
with the judgments given by the Os. The 
difference between these two comparisons is 
that in the latter case the S’s self-judgment is 
included. The same types of questions can be 
asked by comparing the self-judgments with 
the Os’ judgments and by comparing the 
self-judgments with the judgments given 
by the other two group members. The statisti- 
cal summaries for each of these comparisons 
are presented in Table 4. 

An examination of Table 4 indicates con- 
siderable agreement in the assessment of 
behavior on a within-session basis. This agree- 
ment appears when comparisons are made 
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CONSISTENCY OF INTERPERSONAL BEHAVIOR JUDGMENTS 


TABLE 5 


SuMMARY OF LEVELS OF SIGNIFICANCE OF ITEMS BETWEEN AND WITHIN SESSIONS 


Between Sessions 


Item 
‘Observer | Member | Self-Self 

Contribution to group de- II I Ill 

cision 
Clerical work II III I 
Leadership I II I 
On ball Ill II § I 
Agreeable and willing II III Ill 
Dominant I Ill I 
Good job III III II 
Cooperative I Ill II 
Satisfied with group deci- III III I 

sion 
Frustrated Ill II Ill 
Interest III I I 
Yielded III III III 


Within Sessions 


Observer- Observer- 





eho ? Sete 
. Member Self Self Group 
Group- 
Group Se Ree A ee oe en) ar ia ae 
Ses- | Ses- | Ses- | Ses- | Ses- | Ses- | Ses- | Ses- 
sion 1 sion 2| sion 1 | sion2 | sion 1 | sion 2; sion 1| sion 2 
II I I I I I I I I 
I I I I I I I I I 
II I I I I I I I I 
I I | ill I | Il I | if I Il 
II Ill = Ill I | ill II I I I 
I I | i II I Ii II I I 
Ill Ill II | IIL | I | WI | dW | dW | CdD 
Ill II | Ill II II | Ill II II II 
III IIT | INE | IML | I | WI | WT | dW | dA 
II Ill Il II | IT | I | WI | dW | dW 
Ill Ill | Il | Ill II | Ill | iil II II 
Ill II I Ir Ill III II 


III II 





between judgments given by Os and members, 
between members’ judgments and _self- 
judgments, between Os’ judgments and 
self-judgments, and between judgments by 
Os and the group as a whole. 

The tabulation also indicates that consider- 
able agreement obtains between the judgments 
made by independent Os for a single interaction 
situation. Items that produce consistently 
reliable judgments are those involving contribu- 
tion to the group decision (No. 1), clerical work 
(No. 2), and leadership (No. 3). On the other 
hand satisfied with group decision (No. 9) and 
frustrated (No. 10) were the least reliably 
judged of any in Table 4. The remaining 


items were judged consistently in some 
comparisons but not in others. 
Of particular interest is the extent of 


agreement between the self-judgments on the 
one hand and those by the Os and by other 
group members on the other. Self-judgments 
are usually not utilized in analytic situations 
of this type, but apparently they agree well 
with the ratings made by others in regard to 
certain aspects of behavior such as contribution 
to the group decision (No. 1), clerical work 
(No. 2), leadership (No. 3). This statement 
seems to hold for both task situations used 
in the present study. 

In order to provide a comparative analysis 
of the value of items from both a between- 
session and a within-session point of view a 


summary of the significance levels for the 
different comparisons on an_ item-by-item 
basis is presented in Table 5. The items are 
grouped into categories on the basis of their 
levels of significance. Category I contains all 
items significant beyond the 5 per cent level. 
Category II includes items significant between 
the 5 per cent and the 20 per cent levels. 
Category III contains the items that fail to 
achieve the 20 per cent level of significance. 
The items in the first category reach an 
acceptable standard for rejecting the null 
hypothesis. The second category contains 
items which, while not demonstrated to be 
significant within the limited sampling of the 
present study, merit further consideration 
because of their possible value for scaling 
behavior of the type under analysis. Items for 
which there is no demonstrated significance 
from one interaction period to another are 
included in the third category. 

An inspection of this table shows, for 
example, that the item dealing with contribu- 
tion to group decision (No. 1) produced reliable 
judgments on a within-session basis and that 
the judgments for this item are also reliably 
assessed on a member-member basis from a 
between-session standpoint. The same kind of 
statement can be made for the leadership 
(No. 3) and dominant (No. 6) items. On the 
other hand, items such as good job (No. 7) and 
satisfied with group decision (No. 9) do not 
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seem to be judged consistently on a within- 
session basis, nor do they produce consistent 
judgments from a between-session standpoint. 
A more detailed examination of this table 
will identify the items likely to be useful in 
the evaluation of behavior samples on a 
short-term interaction basis for the types of 
situations used in this study. 


SUMMARY AND CONCLUSIONS 


This paper dealt with the reliability of 
social judgments based on direct assessment 
of Ss engaged in short-term interaction. 
In order to assess the reliability of these kinds 
of judgments an experimental design was 
devised in which the conditions in the two 
administrations were different. 

The general plan called for the direct 
assessment and observer evaluation of the 
social and personal characteristics of the same 
33 Ss in two completely independent inter- 
action situations. Changes between the two 
group situations included the task on which 
the members worked, the composition of the 
groups, and the identity of the Os. On comple- 
tion of each task, group members were asked 
to fill ina 12-item, direct-assessment question- 
naire on which they judged their own behavior 
and that of the other two group members on a 
ranking basis. The Os judged the behavior 
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of the group members using the same question- 
naire. The results confirm the proposition 
that reliable judgments of short-term inter- 
action can be made even when the two 
situations are different. Items that permit 
consistent assessment from both the between- 
and within-sessions points of view include 
leadership (No. 3), contribution to group 
decision (No. 1), and dominance (No. 6). Also 
of interest is the finding that Ss’ judgments of 
their own behavior, a type of data frequently 
disregarded, are related significantly to the 
judgments made by Os and other group 
members and therefore merit further investiga- 
tion for possible utility. These results support 
the view that direct-assessment data can serve 
as one basis for evaluating personal and social 
characteristics that cannot be satisfactorily 
measured by other techniques. 
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CASE REPORT 
A BLIND ANALYSIS OF A CASE OF MULTIPLE PERSONALITY USING 
THE SEMANTIC DIFFERENTIAL 


CHARLES E. OSGOOD anp ZELLA LURIA 
University of Illinois 


OR MORE than a year the writers have 
been collecting semantic data from pa- 
tients undergoing psychotherapy. This 
has been feasible through the cooperation of 
psychotherapists in various parts of the 
country. When the manuscript of “A Case of 
Multiple Personality” (3) arrived, the editor 
of this journal, without our knowledge, sug- 
gested to Thigpen and Cleckley that it would 
be interesting to have semantic data from 
each of the “personalities” in their patient. 
Thigpen and Cleckley accepted the suggestion 
and administered a form of the differential we 
have been using for this purpose to their pa- 
tient in each of her three {personalities. The 
editor also suggested that we might see how 
much these semantic data would allow us to 
infer about the patient without our having 
any knowledge of the case history, protocol, 
or prognosis.' This appeared to be a useful 
and rather intriguing way to estimate the 
validity and sensitivity of this instrument. 
If we could infer descriptions of the three 
personalities which correspond with clinical 
observations, and if we could make reasonably 
accurate interpretations and predictions, we 
would be encouraged to continue our efforts 
to improve the semantic differential as a 
clinical tool. 
THE SEMANTIC DIFFERENTIAL 
The semantic differential is a combination 
of association and scaling procedures de- 
signed to give an objective measure of the 
connotative meaning of concepts. The under- 
lying logic (1) can be summarized as follows: 
(a) The process of description or judgment 
can be conceived as the allocation of a con- 


1 We wish to thank Dr. J. McV. Hunt, editor of this 
journal, for utilizing the opportunity presented by the 
work of Thigpen and Cleckley to arrange for the data 
in this study, for suggesting the blind analysis, and for 
his general encouragement. We are also very grateful 
to Drs. Thigpen and Cleckley for giving the semantic 
differential to their patient and for their interest in the 
instrument. 


cept to a set of experiential continua defined 
by pairs of polar terms. Thus the connotative 
meaning of a linguistically complex assertion, 
such as “My father has always been a rather 
submissive person,” can be at least partially 
represented as 


_passive 


MY FATHER active__:__:__:__: X:_: 
: hard 


MY FATHER ss soft__: X:__:_: 


The greater the strength of association, e.g., 
“extremely submissive, a regular door- 
mat,” the more polarized, toward 1 or 7, the 
allocation (2). Since many scales of judgment 
are highly intercorrelated (e.g., good-bad, fair- 
unfair, honest-dishonest, kind-cruel, and so 
forth all reflect mainly the single “evaluative” 
factor in judgments), a limited number of 
such continua can be used to define a semantic 
space within which the connotative meaning 
of any concept can be specified. This clearly 
indicates some variant of factor analysis as 
the basic methodology in developing such an 
instrument. Two such analyses have been 
completed, both providing evidence for three 
general factors, “evaluation,” “potency,” and 
“activity,” and some unknown number of 
specific factors that are probably denotative 
in nature. 

The form of semantic differential we have 
been using in studying psychotherapy is 
based on this factor analytic work. In the 10 
scales used, it gives approximately equal 
weight to the first three factors isolated. These 
scales and their factor loadings are given in 
Table 1. The 15 concepts used in this form of 
the differential were selected after consulta- 
tion with clinicians and pretesting for their 
differentiating power. Ideally, they should 
sample the major persons and problems in- 
volved in therapy-in-general; we are not 
entirely satisfied with the present set, how- 
ever, and more work should be done here. The 
concepts used are also shown in Table 1. In 
the test form itself, concepts are rotated 
against scales in such a way that each con- 
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cept appears once with each scale, but with a 
maximum interval between successive ap- 
pearances of both. The subject is instructed 
to do his checking rapidly, without struggling 
over particular items, to give his “immediate 
impressions.” A 150-item form such as this 
usually takes less than 10 minutes to com- 
plete. 

Reordering the raw data for a single sub- 
ject on a single testing yields a matrix of NV 
columns (here, 15 concepts) and 7 rows (here, 
10 scales). The meaning of a particular con- 
cept to the subject, as defined by the opera- 
tions of measurement here, is the profile of 
numbers in its column (or, more efficiently, 
the position in the n-dimensional space de- 
fined by the projection of these numbers onto 
the factors). Difference in meaning for two 
concepts is defined by the distance between 
their positions in this space, as computed by 
the generalized distance formula, D = V >a, 
in which d is the difference in allocation of the 
two concepts on a single scale (2). The more 
similar any two concepts are in connotative 
meaning, the smaller will be the value of D. 
Change in meaning (of the same concept at 
different times during therapy, or in different 


TABLE 1 
CONCEPTS AND SCALES UsEep tn Tuts ANALYSIS 


Concepts 


LOVE MENTAL SELF-CON- 
CHILD SICKNESS TROL 
MY DOCTOR MY MOTHER HATRED 
ME PEACE OF MY FATHER 
MY JOB MIND CONFUSION 
FRAUD SEX 
MY SPOUSE 
Scales and Their Factor Loadings 
Scales Evaluation Activity Potency 
valuable-worthless .79 13 .04 
clean-dirty .82 .03 — .05 
tasty-distasteful 77 —.1l 05 
fast-slow 01 .70 00 
active-passive 14 59 .04 
hot-cold — .04 46 — .06 
large-small 06 34 -62 
strong-weak 19 .20 62 
deep-shallow 27 14 46 
tense-relaxed 55 37 —.12 
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““personalities”) can be defined by the same 
operation, except that d here refers to the 
differences in allocation of the same concept 
on the same scale at different testings. The 
mathematical properties of this formula also 
allow us to represent the semantic structure of 
an individual in a concise form; computation 
of the distance, D, of every concept from 
every other concept yields an V/N matrix 
(here, 15/15) of distances which have the 
property of plotting within a space having 
dimensionality equal to the number of factors. 
To the extent that the individual subject 
being studied uses the same three factors 
isolated in our general factor work, his data 
will plot accurately in three dimensions. 


THe SEMANTIC DATA 


At this point we should state exactly what 
information we have about this case. We know 
that we are dealing with a case of triple 
personality, and these have been labeled for 
us (presumably by the therapists who col- 
lected the semantic data) “Eve White,” “Eve 
Black,” and “Jane.” We suppose that the 
“White” and “Black” have some connotative 
significance—certainly, as will be seen, the 
quantitative semantic data distinguish sharply 
between them. We also know, of course, that 
the patient is a woman, presumably partici- 
pating in some kind of therapy; we do not 
know the stage of therapy or whether or not 
she is hospitalized. We considered it also fair 
to ask (from J. McV. Hunt) about the follow- 
ing items of sociological status, because they 
contribute to the meaningful interpretation of 
certain concepts: Concept cHILD—does this 
woman have a child? Yes, she does. Concept 
SPOUSE— is this woman married? Yes, she is. 
Concepts FATHER and MOTHER—are her 
parents alive? The mother is, but Hunt 
doesn’t know about the father. Concept my 
jop—has this woman had a job outside of 
homekeeping? Yes, she has. This is the sum 
total of our external information about the 
case. 

The semantic differential was given to this 
woman twice while “in” each of her three 
personalities; a period of about 2 months 
intervened between the two testings. The 
raw semantic data for each of the three per- 
sonalities are given in Tables 2-4. The roman 
numerals I and II refer to first and second 
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testings respectively.2 We take the space to 
give these raw data in full so that anyone who 
is interested may study them and test any 
particular “hypotheses” about the case he 
wishes. Low values in these tables indicate 
judgments toward the polar term on the left 
and high values judgments toward the polar 
term on the right. The data in these tables 
show the semantic profiles, or meanings, of 
each concept at each testing and while “in” 
each personality. Since the form given at each 
testing was actually a double form (each item 
repeated once), we were able to estimate the 
reliability of these data. The immediate test- 
retest reliability coefficients for each of the 
testings are as follows: Eve White I, .82; Eve 
White II, .90; Eve Black I, .65; Eve Black 
II, .89; Jane I, .89; Jane II, .94. These coeffi- 
cients indicate (a) a generally satisfactory 
level of reliability, (6) a consistent trend in 
all three personalities toward greater stability 
through time, and (c) that Jane is the most 
consistent or stable personality over short in- 
tervals of time and Eve Black is the least. 

To obtain measures of semantic similarity 
and structure, we computed the matrices of D 
for each concept with every other concept, for 
each personality and testing. With an ordi- 
nary desk calculator and a table of square 
roots, these operations are very simple and 
rapid. In order to conserve space, the six 
matrices of D are not given here. These “‘dis- 
tances” are based on application of the 
formula given earlier across all 10 scales. For 
convenience in plotting the models which 
appear as Fig. 1-6, the data for scales con- 
tributing to each of the three factors were 
averaged and new D’s computed. This, in 
effect, forces those data into three dimen- 
sions and, hence, into solid models that have 
no error. The very slight amount of distortion, 
or loss of information, resulting from this 
averaging process and restriction to three 
dimensions can be seen from the following 
correlations between original (10 scale) and 
“factor” D matrices (3 average scales): Eve 
White I, .91; Eve White II, .93; Eve Black I, 
.96; Eve Black II, .98; Jane I, .86; Jane II, 
.92. In other words, nearly all of the variance 


? These values are actually averages of two check- 
marks, a double form of the semantic differential having 
been used for reliability purposes. In this double form, 
each item appears twice. 
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in this woman’s judgments can be accounted 
for in terms of three factors. Figures 1-6, then, 
provide quite accurate representations of the 
ways various concepts are related in each of 
the personalities; the smaller the distance 
between any two concepts the more similar in 
connotative meaning they are. 


THE THREE PERSONALITIES AND THEIR 
CHANGES THROUGH TIME 

The general assumption we are following is 
that ‘mental illness’ is essentially a dis- 
ordering of meanings or ways of perceiving 
from those characteristic of people judged 
“normal” in our society, and that the process 
of psychotherapy from the patient’s point of 
view is essentially a reordering and changing 
of these meanings. Within the limitations of 
our type of measurement and our sampling of 
concepts, the locations and relations among 
concepts shown in Fig. 1-6 can be thought of 
as pictures of how this woman perceives her- 
self, the significant people about her, and 
certain modes of when functioning 
“in” her several personalities.? We assume 
that this woman is receiving some kind of 
treatment through the period covered by our 
two samplings, I and II, and therefore look 
particularly for the types of changes in mean- 
ing that are taking place in the three per- 
sonalities, as well as at the general nature of 
their organization. For purposes of ready 
comparison, all of the models are oriented in 
respect to the concept My pDocToR, which 
stays practically constant in meaning (good, 
strong, and guile active) through both time and 
personalities; spatially, in the figures, good is 
up and bad down, active to the left and passive 
to the right, and strong is away from the 
viewer while weak is near to or toward the 
viewer; the solid ball represents the origin 
of the space, e.g., a hypothetical ‘‘meaning- 
less” concept that would result from checking 
all 4’s. 
Eve White 

Semantic structures for Eve White I and 
II are shown in Fig. 1 and 2. The most general 
characterization would be that Eve White 
perceives “the world” in an essentially normal 


action 


fashion, is well socialized, but has an unsatis- 


5 The authors wish to thank Professor Jozef Cohen 
for his help in preparing these figures. 
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LOVE 














Fic. 1. Eve Warre I 





factory attitude toward herself. Here the usual 
societal “goods” are seen favorably—my 
DOCTOR, MY FATHER, LOVE, SELF-CONTROL, 
PEACE OF MIND, and MY MOTHER are all good 
and strong whereas FRAUD, HATRED, and to 
some extent CONFUSION are bad. The chief 
evidence of disturbance in the personality is 
the fact that me (the self concept) is con- 
sidered a little bad, a little passive, and defi- 
nitely weak. Substantiating evidence is the 
weakness of her CHILD, as she sees him (or 
her), and the essential meaninglessness to her 
of my spouse and sEx. Note also the wide 
evaluative separation between LOVE and SEx. 
In the interval between testings I and II mE 
and sEx become more bad and passive and 
simultaneously become almost identical in 
meaning to her—and note that her concep- 
tions of LOvE (a good, strong thing) and sEx 
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HATRED 





FRAUD 


Fic. 2. Eve Waite II 


(a bad, weak thing like herself) have moved 
still further apart. 


Eve Black 


Semantic structures for Eve Black I and II 
are shown in Fig. 3 and 4. The most general 
characterization here would be that Eve Black 
has achieved a violent kind of adjustment in 
which she perceives herself as literally perfect, 
but, to accomplish this break, her way of per- 
ceiving “the world” becomes completely dis- 
oriented from the norm. The only exceptions to 
this dictum are MY DOCTOR and PEACE OF 
MIND, which maintain their good and strong 
characteristics, the latter, interestingly 
enough, also becoming active on II. But if 
Eve Black perceives herself as being good, 
then she also has to accept HATRED and 
FRAUD as positive values, since (we assume) 
she has strong hatreds and is socially fraudu- 
lent. So we find a tight, but very un-normal, 
favorable cluster of ME, MY DOCTOR, PEACE 
OF MIND, HATRED, and FRAUD. What are posi- 
tive values for most people—cHILD, MY 
SPOUSE, MY JOB, LOVE, and SEX—are com- 
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Fic. 3. Eve Brack I 


pletely rejected as bad and passive, and all of 
these except CHILD are also weak (this may be 
because CHILD was weak in Eve White and 
much of the change here is a simple “flip-flop” 
of meanings). Note that it is MOTHER in this 
personality that becomes relatively meaning- 
less; FATHER, on the other hand, stays good 
but shifts completely from strong (in Eve 
White) to weak—possible implications of 
these familial identifications will be con- 
sidered later. Note also that in this per- 
sonality LOVE and sex are closely identified, 
both as bad, weak, passive things. 


Jane 


Semantic structures for Jane I and II are 
shown in Fig. 5 and 6. The general charac- 
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terization is that Jane displays the most 
“healthy” meaning pattern, in which she ac- 
cepts the usual evaluations of concepts by her 
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society yet still maintains a satisfactory evalua- 
tion of herself. MY FATHER, MY MOTHER, MY 
CHILD, and My pocror—most of the signifi- 
cant persons in her life—are seen as good, 
strong, and active. The major modes of be- 
havior, PEACE OF MIND, LOVE, SELF-CONTROL, 
and MY JOB, are seen as equally good and 
strong, but somewhat passive—as if these ways 
of behaving and thinking were simply ac- 
cepted without stress. The two socially 
agreed-upon evils, HATRED and FRAUD, are 
put in their proper places. The most signifi- 
cant characteristics of Jane’s meaning system, 
however, are these: The self concept, ME, 
while still not strong (but not weak, either) is 
nearer the good and active directions of the 
semantic space; note also the close identifica- 
tion of ME and MENTAL SICKNESS, which here 
is mot an unfavorable concept to her. Her 
attitude toward her husband, My SPouSsE, is 
for the first time meaningful (unlike Eve 
White) and tending toward the good, strong, 
active directions, like the other significant 
persons (unlike Eve Black). And Love and 
SEX (quite unlike Eve White) are both favor- 
able and quite closely identified. The changes 
from testings I to II are simply such as to 
strengthen the “healthy” pattern evident in 
the first view. ME becomes considerably more 
good and active; MY SPOUSE for the first time 
becomes completely identified connotatively 
with MY DOCTOR and MY FATHER (and loses its 
tie with CONFUSION); and LOVE and SEX be- 
come intimately identified with each other 
and close in meaning to SELF-CONTROL and 
PEACE OF MIND. 

The thumbnail semantic sketches of each 
personality just given make it intuitively 
evident that the semantic differential does 
draw sharp distinctions between the three 
personalities inhabiting one nervous system. 
It is possible to demonstrate these distinctions 
quantitatively by intercorrelating D matrices 
between personalities and over time. If two of 
our models are generally similar in structure, 
such that large and small distances between 
concepts in one are reflected also in the other, 
then the 7 will be high. Table 5 gives these 
correlations. The first thing to note is that 
the correlation of each personality with itself 
(e.g., testings I and II) is regularly much 
higher than the correlation of that personality 
with any other personality (with the single 
exception of Eve White I and Jane I). This 


TABLE 5 


CORRELATIONS OF D-MATRICES BETWEEN PERSONALI- 
TIES AND OVER TIME 








White| White Black | Black | Jane | Jane 
Ii/;mirinig:isn 


White I _ 

White II 13) — 

BlakI —.06' — — 

Black II — |—.02 86), — 

Jane I 13) — |\—.6); — — 

Jane II _— 53 — |\—.08 | .92) — 


is quantitative justification for the statement 
that the semantic differential does differ- 
entiate between the several personalities of 
this woman. Whether it differentiates in a 
valid way is a matter that can be judged only 
by relating our analysis to the detailed case 
history material available elsewhere (3). 

Another important thing to note about 
these correlations is that Eve White and Jane 
(the two “socialized” personalities) are fairly 
highly correlated whereas the correlations of 
Eve Black with the cther two are definitely 
low, even negative. In other words, Eve 
Black is clearly the most deviant and dis- 
ordered personality. Finally, it should be 
noted that these three personalities differ 
somewhat in their stability, as indexed by the 
I/II correlations, Eve White being the least 
stable and Jane the most. 


CHANGES IN MEANING OF SPECIFIC CONCEPTS 


As noted earlier, the meaning of a specific 
concept is operationally defined as its profile 
against the differential (e.g., its position in 
semantic space). Change in meaning between 
testings I and II can be measured directly by 
the D between I and II profiles for the same 
concept (e.g., between the paired columns in 
Tables 2-4). These D values are given under 
“Within Personalities, Between Testings” in 
Table 6. Changes in meaning between per- 
sonalities for the same concepts can be meas- 
ured directly by the D between profiles for 
the same concept but as judged in two differ- 
ent personalities; these D values are given 
under “Between Personalities, Within Test- 
ings” in Table 6. 


Semantic Stability Through Time 


In general, although the differences are not 
great, Eve Black is the least stable personality 
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TABLE 6 
CHANGES IN MEANING OF SPECIFIC CONCEPTS 


| Within Person- 


alities, Between Between Personalities, 


Within Testings 


Testings 
Concept © 
' | = oa ae Dy-s Dw-; Ds-; 

PPPS | eee 

eS Oe FOo™ 

| g & aoe Se oe Se ae 
LOVE } 42 | .67  .23 |1.581.44).35) .$71.62)1.81 
CHILD 71 | .96 | .37 |1.65}1.40).68) .54)1.47/1.41 
MY DOCTOR 15 | .05 | .27 | .23] .23|.28) .25) .30) .12 
ME -36 | .32 | .42 |1.21/1.40].60) .88) .83 .77 
MY JOB 27 | .62 | .42 |1.30/1.43).49) .43/1.19)1.54 
MENTAL SICK- | .45 | .45 | .19 |1.24/1.38).40) .32:1.30/1.47 
NESS 
MY MOTHER .54 | .78 | .46) .71) .78).66) .23/1.02) .68 
PEACE OF MIND | .40 | .35 | .41 | .86) .66).21) .28) .81! .61 
FRAUD 64 | .12 | .40 1.46)1.35).73) .34/1.29 1.22 
MY SPOUSE 30 | .62 | .47 | .67| .96).64, .89/1.04/1.75 
SELF-CONTROL .32 25 | .24 | .78) .80).34) .39) .921.01 
HATRED 51 19 | .44 (1.54/1.31/.51) .23)1.371.19 
MY FATHER 53 | .71 | .09 |1.06) .60/.25) .43'1.06, .43 
CONFUSION 64 | .67 | .44 .86) .96).71) .42) .98 .88 
SEX 62 | .34| .47 (1.10) .62).63 1.20)1.45/1.76 


Note.—Numbers in italics indicate concepts that serve best to 
characterize differences between Eve White and Jane. 


through time and Jane the most stable (cf., 
columns 2-4 in Table 6). The concept-by- 
concept data thus confirm the stability of 
Jane as shown in the structural data given 
above. For Eve White the most unstable or 
labile concepts are CHILD, MY MOTHER, FRAUD, 
HATRED, MY FATHER, CONFUSION, and SEx. For 
Eve Black the most labile concepts are LOVE, 
CHILD, MY JOB, MY MOTHER, MY SPOUSE, MY 
FATHER, and CONFUSION. For Jane the most 
unstable notions are ME, MY JOB, MY MOTHER, 
MY SPOUSE, HATRED, CONFUSION, and SEx. We 
note that the family constellation—mother, 
father, spouse, child—tend to be more sus- 
ceptible to change through time within these 
personalities, but that the self concept stays 
relatively constant within personalities (e.g., 
the location of the self concept, in a sense, 
defines these three personalities). HATRED, 
SEX, and CONFUSION also seem to be points of 
stress. 


Semantic Stability Between Personalities 


The data given in columns 5-10 in Table 6 
make it clear that concept meanings in general 
shift more between Eve Black and the other 
two than between Jane and Eve White, this 
again substantiating the over-all correlations 
between total structures. The only concept 
that. remains strictly constant in meaning 
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through the personality changes in this woman 
is MY DOCTOR, although PEACE OF MIND and 
CONFUSION show greater stability than most 
others. It is interesting to note which con- 
cepts serve best to characterize the differences 
between Eve White and Jane (Eve Black 
shows gross differences on almost all con- 
cepts). These two “socialized” personalities 
differ from one another chiefly on ME, My 
SPOUSE, and sEx, and these differences are 
increasing in magnitude through time. This 
clearly suggests this woman’s sexual life as a 
core problem, Eve White being highly critical 
of all three concepts and Jane accepting them 
as positive values. It is also interesting to note 
in this connection that semantic differences 
between Eve White and Jane on CHILD, My 
MOTHER, FRAUD, HATRED, and CONFUSION are 
decreasing through time. 


INTERPRETATIONS OF THESE SEMANTIC DATA 


The analyses of these personalities and their 
changes given so far have been descriptive 
rather than interpretive for the most part. In 
a sense, we have merely put into words what 
this woman herself, in her several personali- 
ties, has indicated by her check marks. The 
treatment of semantic differential data, from 
the patterns of check marks to construction 
of the models shown in Fig. 1-6, is completely 
objective, and any investigator starting from 
the same data and following the rules would 
have to end up with the same pictures we 
have. 

Making interpretations and predictions 
about this case on a “blind” basis is another 
matter entirely. In this section we go far 
beyond the objective data, and we are conse- 
quently much less confident about our state- 
ments. For one thing, neither of the writers is 
an experienced clinician—certainly not ex- 
perienced with respect to the dynamics and 
characteristics of multiple personality. For 
another thing, we do not know at what stage 
in therapy our two testings were made, and 
interpretation would certainly vary greatly in 
terms of such information. It should also be 
pointed out that in the ordinary use of the 
semantic differential as a clinical tool (as 
compared with a blind analysis) many other 
sources of information would be available to 
support certain alternative interpretations 
and render others farcical. Let it be under- 
stood, then, that what follows is a flight into 
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conjecture, in contrast with the preceding, 
factual reporting of semantic data. 


Interpretive Descriptions of the Three Per- 
sonalities 


Eve White is the woman who is simultane- 
ously most in contact with social reality and 
under the greatest emotional stress. She is 
aware of both the demands of society and her 
own inadequacies in meeting them. She sees 
herself as a passive weakling and is also con- 
sciously aware of the discord in her sexual 
life, drawing increasingly sharp distinctions 
between LOVE as an idealized notion and sEXx 
as a crude reality. She maintains the greatest 
diversity among the meanings of various 
concepts. She is concerned and ambivalent 
about her CHILD, but apparently is mot aware 
of her own ambivalent attitudes toward her 
MOTHER—and seems to become more resistant 
to this by testing II. Those psychoanalytically 
inclined may wish to identify Eve White with 
dominance of the superego: certainly, the 
superego seems to view the world through the 
eyes of Eve White, accepting the mores or 
values of others (particularly her mother) but 
continuously criticizing and punishing her- 
self. If this case came to the psychotherapists 
with a voluntary, self-initiated plea for help, 
then it seems likely that Eve White was 
dominant at the time. 

Eve Black is clearly the most out of contact 
with social reality and simultaneously the 
most self-assured. To rhapsodize, Eve Black 
finds PEACE OF MIND through close identifica- 
tion with a God-like therapist (My DocToR, 
probably a father symbol for her), accepting 
her HATRED and FRAUD as perfectly legitimate 
aspects of the God-like role. Natura!ly, she 
sees herself as a dominant, active wonder- 
woman and is in no way self-critical. She is 
probably unaware of her family situation. 
Those psychoanalytically inclined could say 
that the id looks out at the world through the 
eyes of Eve Black. Like a completely selfish 
infant, this personality is entirely oriented 
around the assumption of its own perfection. 
Actually, Eve Black seems to be more har- 
monious with the Adlerian than with the 
Freudian model, since personal perfection is 
apparently the demand acceded to rather 
than sexuality. If the case was committed to 
an institution, it seems likely that this per- 
sonality was the reason for commitment. 


Jane is the most puzzling of the three per- 
sonalities, and our interpretation will have to 
depend upon assumptions about the stage of 
treatment (see below). Superficially, Jane is 
a very healthy personality: “‘all’s well with the 
world, and day by day I’m getting better and 
better.” Thus we find all the people in her 
life perceived as good and strong and active 
and all the socially approved modes of action 
perceived as good and strong and passive; 
SEX is LOVE-ly, her SPOUSE is becoming more 
like the noble pocror all the time, and she is 
coming to perceive herself even, as a pleasant 
and reasonably active (if somewhat weak and 
submissive) person. But all this is a little too 
rosy, a little too pat. We note that Jane is 
becoming more and more “simple-minded”— 
all of her judgments tending to fall along a 
single factor of good-strong vs. bad-weak— 
which makes the Jane II model the most 
restricted and undiversified of all. Those 
psychoanalytically inclined may wish to view 
this personality as representing dominance of 
a self-deceptive ego which has woven a web of 
repression as to the state of reality; or, they 
may wish to view Jane as an essentially 
strong, healthy, and improving ego-dominated 
personality. In any case, we doubt if Jane 
would have either come for therapy or have 
been institutionalized—as such. 


Possible Dynamisms Operating in the Case 


Identification mechanisms. We say the pa- 
tient “identifies” with some other person when 
her meaning of herself, mE, is semantically 
close to her meaning of the other person; e.g., 
if she sees her father as a kind, active, relaxed, 
etc. person and describes herself in the same 
terms, we infer identification. However, the 
pattern of identifications displayed by this 
patient seems unusual. Only in Eve Black, the 
obviously disoriented personality in terms of 
her values, is there clear differential identifi- 
cation—with her FATHER (and this may re- 
flect the semantic tie-up between FATHER and 
MY DOCTOR). Jane shows some slight tend- 
ency toward closer identification with MOTHER, 
but it is not close. Eve White shows none 
with either parent. The fact that identifica- 
tion with FATHER (and My DOCTOR) in Eve 
Black is accompanied by rejection of MOTHER 
to meaninglessness is suggestive of an under- 
lying conflict in identifications. Note also, in 
this connection, that in Eve Black I the 
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ascendancy of ME to the good, strong, active 
position is accompanied by making FATHER 
weak—as if she were taking over her father’s 
role and putting her mother in her own previ- 
ous place. And, interestingly enough, the 
concept SELF-CONTROL suffers the same fate 
aS MOTHER. This picture of Eve Black is 
certainly suggestive of an Electra complex as 
the underlying dynamism. In “real” life, her 
MOTHER is or was the dominant, threatening 
figure—moralizing, demanding standards and 
SELF-CONTROL—and in Eve Black this woman 
escapes the pressure by rendering both 
MOTHER and SELF-CONTROL meaningless and 
simultaneously identifying with and taking 
her FATHER’S place (via the therapist). Sug- 
gestive evidence may be found in Tables 2-4: 
MOTHER is consistently colder than FATHER and 
usually more fense and fast (e.g., Factor 3). 
Identification of the self with the therapist 
in Eve Black is perfect, of course. The con- 
cept MY DOCTOR is the only personal concept 
to show perfect stability both between per- 
sonalities and through time. The patient thus 
displays what might be called maximal positive 
transference in all three personalities; there is 
no sign of any negative transference at either 
testing, which may be indicative of the stage 
of therapy (e.g., early). 

Significance of the patient’s sexual life. 
Although Jane shows a rosy acceptance of 
normal sexual patterning—with LOVE and 
sEx linked and passively favorable, Eve 
White clearly displays awareness of a basic 
conflict in this area—sEx is early somewhat 
more distasteful than LOVE and becomes 
distinctly distasteful and dirty by testing II. 
In Eve White also we find ME and My SPOUSE 
becoming linked with sex in this unpleasant 
location. Eve Black, on the other hand, re- 
jects both sex and LovE—but closely links 
them in her thinking. If we were to relate 
these facts with the Electra situation de- 
scribed above, the interpretation would be 
that her persisting conflict with her mother 
and attempts to identify with her father make 
it impossible for her to experience normal 
sexual satisfactions with her husband and to 
carry out the normal mother-wife-home role. 
Eve White is aware of this, in a sense, but 
Jane clearly is not. The concept My JoB is 
interesting in this connection: its persistent 
linkages with LOVE, PEACE OF MIND, and 
SELF-CONTROL in the two “socialized” per- 
sonalities, coupled with its linkage with sEx 
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and MY SPOUSE in Eve Black, clearly suggests 
to us that this woman is interpreting My JoB 
in the sense of ‘my job as a mother, wife, and 
homebuilder” rather than in terms of her 
outside work (which we understand she has). 
In any case, there is clear evidence of involve- 
ment with her sex life as a major problem, 
and this may have been the presenting prob- 
lem when she began therapy. 

Repression and amnesia. Knowing that we 
are dealing with a case of multiple person- 
ality—usually characterized by complete 
dissociation between states—it is interesting 
to speculate on what meanings are repressed 
in the several personalities. It will be recalled 
that, operationally, meaninglessness of a con- 
cept is defined by its closeness to the origin 
(the solid balls in the models). This is prob- 
ably to be interpreted as ‘‘connotative dead- 
ness” or “damping of affect” with respect to 
the concept involved. Within the matrix of 
our instrument, however, there is another 
way in which repression or amnesia may 
show up, and that is via a complete shift in 
meaning of the concept being judged (e.g., 
CHILD may shift from the personal reference 
of “my own youngster” to “children-in- 
general’’). Looking back at the semantic data 
with these points in mind, we hazard the 
following guesses: Eve White probably has 
the best contact with reality and may not be 
amnesic at all (except for the other person- 
alities); Eve Black may be amnesic for her 
mother and her own role as a mother and 
wife; Jane in Pollyanna fashion may be 
amnesic for her own problems, e.g., MENTAL 
SICKNESS and CONFUSION, and the _indis- 
criminate way in which she lumps all socially 
favorable concepts at least suggests that she 
is judging CHILD, MOTHER, etc. in the abstract 
rather than as MY CHILD, MY MOTHER, and so 
forth. 


Interpretation I—Assumed Early Stage of 
Therapy 


The “original” personality, in the sense of 
being most characteristic of the woman her 
friends and relations knew, was Jane. The first 
testing of this personality shows a relatively 
weak ME that is associated with MENTAL 
SICKNESS; i.e., she was dimly aware of her 
own inadequacies but was striving to main- 
tain a rigid acceptance of the real world and 
maintain an adequate home life. The people 
about her, with the exception of her husband, 
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were seen as strong and active (perhaps 
threatening) in relation to herself, and her 
love life was regarded as a sort of deliberate, 
controlled duty. She was completely un- 
aware of her (repressed) emotional ambiva- 
lence toward her mother, husband, and child. 
The things being hidden in this personality, 
and providing the force behind the eventual 
split, were (a) her Electra complex, (6) her 
repugnance for sexual relations with her 
husband, and (c) her ambivalent attitude 
toward herself. We suspect that she had a 
position in society that demanded “good 
front.” 

We must assume strong and about equal 
pressures toward solving the Electra complex, 
(a) by identifying with FATHER and asserting 
the self (id?), and (6) toward solving it by 
identifying with MOTHER and devaluing her- 
self (superego?). This produces a two-way 
split away from the Jane pattern, one into 
Eve Black where selfish needs for superiority 
and playing the father role are achieved and 
another into Eve White where societal needs 
for submission and playing the mother role 
are achieved. This split, and the subsequent 
availability of the other roles, allows Jane to 
shift toward the “sweetness and light” view 
of the world, and this is clearly demonstrated 
by the changes between Jane I and Jane II. 
Eve White continues to become more simply 
and rigidly self-critical and Eve Black con- 
tinues to become more simply and rigidly 
self-satisfied. 

Assuming successful therapy is possible— 
which seems questionable—it will involve less 
and less time being spent in being Jane and 
Eve Black and a consequent shift into Eve 
White, where better contact and differentia- 
tion seems to be maintained. But here it will 
be necessary to bring Eve White to under- 
stand the reason for her depression, the role 
of her ambivalence toward her mother in her 
problem—which shows no signs of happening 
yet—and thence a gradual restructuring in 
which ME becomes more favorable, along with 
SPOUSE and sEx, and identifications with 
FATHER and MOTHER are reassigned. This will 
probably involve a period of negative trans- 
ference, with My pocToR and MOTHER becom- 
ing closely identified and being temporarily 
shifted to bad, strong, and active directions of 
the semantic space. In other words, successful 
treatment will mean increasing time spent in, 


and a gradual restructuring of, Eve White 
to the point where it incorporates what is now 
Jane, but with a realignment of significant 
persons. On the other hand, if this woman is 
in a mental institution and remains there, it 
seems likely that Eve Black will become the 
dominant house she lives in. In either case, it 
is probable that Jane will appear less and less. 


Interpretation II—Assumed Late Stage of 
Therapy 


If we assume that we are seeing the ter- 
minal stages of therapy with a case of this 
sort, then a quite different interpretation is 
necessary. The difference in interpretations 
hinges upon Jane, either as a deceptive and 
vanishing original personality (interpretation 
I) or as an increasingly healthy and aug- 
menting personality (interpretation II). In 
the latter case we would assume that Eve 
White had been the “original” personality as 
people knew her—a socially acceptable wife 
and mother, but one laden with conflicts, 
anxieties, and self-criticism. The split in this 
case—into one personality in which the self- 
criticism completely disappears via _irra- 
tionality (Eve Black) and another in which 
self-criticism vanishes via rationality (Jane)— 
seems less sensible, however. Jane seems un- 
necessary at this stage and really should have 
developed out of Eve White rather than being 
contemporaneous. If we assume there was a 
split in any case, Jane is clearly the most 
healthy personality, since LOvE and SEX are 
identified, the world is viewed in acceptable 
fashion, and the self concept is becoming 
more favorable all the time. The prediction 
here would be increasing time spent in Jane 
and less in the others. The stumbling blocks 
in the way of this interpretation are (a) the 
lack of any realignment of the system of 
parental identifications, and (5) the fact that 
Jane is becoming Jess diversified semantically 
(more “simple-minded’”’) rather than the 
reverse. This second interpretation was ac- 
tually the one we first adopted—because of 
the superficial “healthiness” of Jane—but 
consideration of all the evidence seems to 
favor the first interpretation. However, it 
should be noted that if this case is near the 
end of successful therapy, Jane is the only 
personality that combines both a normal view 
of the world and reasonable (increasing) ac- 
ceptance of the self. 
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Interpretation I1I—Combination of I and II 


It is possible to combine interpretations I 
and II by assuming that Jane is both the 
original personality which broke apart and 
the terminal personality which is being de- 
veloped out of therapy. In this case, the early 
development of the case, probably in child- 
hood, would be the same as that given under 
interpretation I—the conflicting parental 
identifications (id and superego determined 
respectively) were of about equal strength and 
finally became too intense to be contained 
within the self-deceptive personality organiza- 
tion of Jane. During the middle course of the 
case, when therapy was undertaken, we thus 
find all three personalities oscillating, tem- 
porary dominance of the mother and wife 
role being represented by Eve White, tem- 
porary dominance of the self-gratifying father 
role being represented by Eve Black, and 
temporary dominance of the face-saving, 
problem-solving ego being represented by 
Jane. Intentionally or unintentionally, the 
effect of therapy may be to strengthen the 
self-deceptive organization of Jane without 
resolving the underlying conflicts dramatized 
by Eve White and Eve Black. The over- 
simplified, Pollyanna-like ways of perceiving 
herself as good along with all the other signifi- 
cant persons in her life yields a superficially 
happy person who views the world in an ac- 
ceptable, if rigidly stereotyped, fashion. If 
the present combined interpretation approxi- 
mates the actual situation, then we feel 
compelled to predict another breakdown at 
some later period in this person’s life. In 
other words, the effect of therapy (whatever 
type it may have been) seems to have been 
further to strengthen the self-deceptive 
original organization of Jane, while making 
this personality even more rigid and insensi- 
tive to subtle differences in meaning and 
without resolving the underlying conflicts 
which created the original disturbance. 


What Price Therapy? 

It is impossible to tell from our semantic 
data whether the increasing simplification in 
structure characteristic of all three person- 
alities is due to therapy itself or is happening 
despite therapy. However, a number of spe- 
cialists in psychotherapy have from time to 
time expressed concern over the “hidden” 
effects of therapy even in so-called successful 
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cases—particularly reduction in _ initiative, 
creativeness, and flexibility of the patient. 
Certainly in the present case we are wit- 
nessing an over-all reduction in differentiation 
of meanings. If overt behavior is in consider- 
able part determined by meanings, as we be- 
lieve it is, then we must expect Jane (if she is 
the terminal personality) to be now even less 
capable of behaving differentially to her 
mother, father, spouse, and child—they are 
all essentially undifferentiated ‘‘strong-active- 
goodness” to her. This would also be true, 
but to a lesser extent, of Eve White, although 
here we would assume an earlier stage of 
therapy and hence a possibility of secondary 
elaboration of semantic diversity under sensi- 
tive therapy. 

Is rigidity of this sort a necessary price of 
therapy? In striving to achieve the goals of 
societal acceptability and invididual happi- 
ness, does the therapist have to sacrifice the 
richness, individuality, and subtler adjustive- 
ness of the patient? These are serious ques- 
tions raised by the data of this single case- 
but not answered by them, of course. From 
the larger sample of cases we are presently 
working on, better answers may be forth- 
coming, but the cases are generally less severe. 
One other interesting phenomenon in the 
present cases should be mentioned: Despite 
the gross changes in meaning of concepts in 
the several personalities, and the over-all 
reduction in diversity, the semantic judg- 
mental frame of reference remains constant. 
In other words, all three of the personalities 
in this woman utilize semantic scales in the 
same ways—the correlations between scales 
are the same for all three personalities and 
reduction in diversity in all of them is accom- 
plished by a coalescence of good, strong, and 
active into a single evaluative dimension. Thus 
it would appear that the level of scale mean- 
ings is below that at which concepts vary, and 
common to all three personalities. 
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CRITIQUE AND NOTES 


EFFECT OF INCREASING THE NUMBER OF RORSCHACH RESPONSES 
ON SUM C AND M: A NOTE ON FISKE AND BAUGHMAN’S STUDY 


CONAN KORNETSKY anp DONALD L. GERARD 
Laboratory for Socio Environmental Studies, Nctional Institute of Mental Health, Bethesda, Maryland 


ISKE AND BAUGHMAN in their recently pub- 

lished study inquired “... to what extent are 
the frequencies of responses in the several scoring 
categories related to the total number of re- 
sponses?” (2, p. 25). Their data indicated that 
there was a positive relationship between the num- 
ber of responses (R) and a number of clinically im- 
portant Rorschach determinants. They state, 
“While it is clear that frequencies in most categor- 
ies are related to the total number of responses, we 
cannot state, on the basis of the present data, 
which influences the other” (2, p. 30). This ques- 
tion which Fiske and Baughman ask may be par- 
tially answered as follows. If additional responses 
are elicited from a group of subjects (Ss) who 
spontaneously produced comparatively few re- 
sponses on the Rorschach so that their total num- 
ber of responses will equal those of a group who 
initially produced a larger number of responses, 
the influence of R on various scoring categories may 
be determined. Thus, it would be reasonable to 
expect that if the sum of the color responses (Sum 
C) and the human fantasy responses (M)! are a 
function of R, there would be no statistically sig- 
nificant difference between the amount of Sum C 
and M in these two groups when the number of 
responses are equated. 


METHOD 


Data which could be utilized for this purpose were 
obtained in a current study of teen-age opiate addicts. 
A procedure similar to that used by Jernigan (3) was 
employed. The Rorschach was first given in the usual 
manner. After this, but before the inquiry, the cards 
were again presented to the S with the instruction to 
tell again everything that he sees, but not to give any 
responses that he had given in the previous adminis- 
tration. 

Twenty-two Ss were studied in this manner. They 
were divided into two groups: Group A consisted of Ss 
who had 18 or more responses on theif first Rorschach 
administration; group B consisted of Ss who had less 
than 18 responses on their first Rorschach (mean: 
15.3, range: 13-16) but who were able to add suf- 
ficient new responses on their second Rorschach so that 





‘In the present study Sum C and M were selected 
because of: (a) their clinical importance, (6) their 
importance in a study now in preparation by the 
senior author, and (c) they appeared frequently 
enough in the small sample used so that statistical 
comparisons could be made. 


the total number of responses for Rorschach 1 and 2 
was over 18.2 For group B the determinants for 
Rorschach 1 and 2 were pocled, i.e., Sum C consisted 
of Sum C on Rorschach 1 plus Sum C on Rorschach 2. 
The distributions of the total number of responses for 
group A and group B were not significantly different 
(group A, mean: 25.0, range: 20-31; group B, mean 
23.4, range: 19-30). 

The Ss were dichotomized at the medians of the 
total distribution of Sum C (median: 1.0) and M 
(median: 1.3). Fisher’s “exact” test of the probability 
of obtaining a set of frequencies in a 2 X 2 table was 
used (1, p. 84) to evaluate the significance of the dif- 
ference between Ss above and below the dichotomy for 
the two groups. 


RESULTS 


The data for Sum C and M are summarized in 
Table 1 and 2 respectively. The probability that 
these obtained sets of frequencies would occur by 
chance are < .03 for Sum C and < .06 for M. 


TABLE 1 
Amount oF Sum C Dickotomizep at 1.0 ror Groups 
A AND B 
Group SumC>10 SumC < 1.0 
A 8 3 
B 1 6 
TABLE 2 
Amount OF M Dicuotomizep at 1.3 For Groups 
A ann B 
Group M > 13 M <1.3 
A 7 4 


B 1 6 


DISCUSSION 


These results indicate that equating the number 
of responses will not equate the frequency of color 
or human movement-determined responses. In 
terms of the problem raised by Fiske and Baugh- 
man, the results suggest that the frequency of color 
and human movement-determined responses is 


2 Four Ss were discarded because their total number 
of responses for Rorschach 1 and 2 was not greater 
than 18. 
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probably not due to the influence of the number of 
responses as an independent variable. 

The Ss who initially proffered the smaller num- 
ber of responses were a constricted nonproductive 
group. The failure of this group to increase the 
number of affective responses so that it would 
equal a more productive group when additional 
percepts were forced may be the consequence of 
their personality structure. Thus a study that used 
proportions in comparing the amount of affective 
responses between a constricted nonproductive 
group and a nonconstricted productive group may 
negate clinically meaningful differences. On the 
other hand, when nonconstricted productive indi- 
viduals add additional percepts, they may con- 
tinue to use affective responses in the same 
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proportion as they previously did, so that in a 


comparison between two productive groups the use 
of proportions may be justified. 
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ADEQUACY OF THE REPORTING-TEST MEASURES FOR DETECTING 
REACTIONS TO FRUSTRATION! 


W. BRUCE DOUGLASS 
University of California, Los Angeles 


AND KENNETH B. BROWN 


University of Missouri 


ECENTLY McKinney et al. (1) reported on a 
R paper and pencil test designed to measure 
changes in performance due to frustration in a 
group situation. This device, called the Reporting 
Test, consists of three similar series of diagrams 
having varying numbers of three types of symbols 
which the subject (5) is required to count. In the 
first series, Condition I (C I), S is allowed to work 
at his own pace for eight minutes. In Condition II 
(C ID), the frustrating situation, he is given an im- 
possible task of completing a designated number of 
diagrams every 30 seconds, for two eight-minute 
periods. To emphasize his failure, S is required to 
mark an “F”’ for every problem not completed in 
each 30-second period. The final condition (C ITI) 
is an eight-minute postfrustration period in which 
S again works at his own speed. 

Scores for each of the conditions on the Report- 
ing Test can be in terms of problems attempted, 
net correct, errors, and percentage correct. The 
differences in scores between the nonfrustrating 
condition, C I, and the frustrating condition, C IT, 
presumably are indicative of proneness to frustra- 
tion. McKinney ef al. (1) present evidence for a 
group of 196 Navy radar trainees that shows that 


‘This article is based upon the senior author’s 
master’s thesis (2) done under the junior author’s 
direction, and submitted to the department of psy- 
chology, University of Missouri. 


under C II as compared with C I Ss made signifi- 
cantly more errors and attempted more problems, 
but had fewer net correct and a smaller percentage 
correct on the average. ““However, not all individu- 
als reacted in the same way to experimental frus- 
tration: the majority speeded up performance at 
the expense of accuracy, but a smaller group 
showed stable performance without increase of 
speed or errors” (1, p. 322). Similar results were 
found when the test was repeated on 98 college 
students. 

While the investigators do not indicate what 
they consider the most appropriate method of 
scoring, they do state that the percentage correct 
score ‘‘. .. appears to be the most promising meas- 
ure of efficiency . . .”” (1, p. 320). 

The present study was designed to investigate 
the reliability and, indirectly, the validity of the 
four scoring methods for relatively unselected col- 
lege students. It is based on the assumption, sup- 
ported by the results cited, that the frustrating 
situation of C II should produce different changes 
in performance scores for different individuals from 
the scores earned under C I. That is, a frustrating 
situation would be expected to have more effect on 
some Ss than on others. For an adequate measure, 
one would then expect (a) high correlations among 
the various parts of the test when the conditions 
are fairly constant and (6) relatively low correla- 
tions when the frustrating condition is introduced. 
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The above reasoning provided a basis for a fairly 
simple method of investigating the four scoring tech- 
niques used on the Reporting Test. This was done by 
setting up a norfrustrated (NF) group, which was ad- 
ministered the three parts of the test without the 
frustrating condition. Another group of Ss, the frus- 
trated (F) group, was given the test according to the 
standard procedure. Following this, intercorrelations 
were obtained among the three parts of the test for 
each group on each measure (problems attempted, net 
correct, errors, and percentage correct). The NF-group 
correlations would provide measures of reliability. Then 
a comparison of the F-group correlations with the 
paired NF-group correlations would give an indication 
of which types of scores significantly reflected the dif- 
ferential changes that the frustrating condition was ex- 
pected to produce. 

The NF group consisted of 101 university students 
(74 males and 27 females) from two general psychology 
classes. The F group also had 101 Ss (65 males and 36 
females) from two other general psychology classes. 

The NF group was given the same instructiuns at 
the beginning as were the Ss of the F group. However, 
after the first eight-minute period (C I), NF-group Ss 
were stopped and told that E£ was interested in deter- 
mining the relative difficulty of the different parts of 
the test. They were then instructed to begin working 
on the C II portion of the test. This was done also for 
the second eight-minute section of C II and for C ITI. 


A total of 28 product-moment correlations was 
calculated for each group, and the significance of 
differences between paired F- and NF-group r’s 
was tested by the z transformation technique. 
However, only a summary of the results will be 
given here. 

All the obtained r’s for the two groups were sig- 
nificantly positive at beyond the .01 level of sig- 
nificance (r of .25 required for 99 df). The correla- 
tions for the NF group may be considered measures 
of reliability of the four types of scores used. How- 
ever, few of the r’s obtained for the NF group were 
high. For both error and percentage correct scores, 
none of the intercorrelations among various parts 
of the test was above .68. For problems attempted 
and net correct the r’s for C I and C II or C ITI 
faired no better (highest r was .61). However, wheu 
the two sections of C II (independently and com- 
bined) were correlated with C III, the r’s for both 
measures were appreciably higher, ranging between 
.78 and .87. 

These results for the NF group suggest that for 
all four measures scores obtained on the prefrustra- 
tion period may be too unreliable to be used in 
individual analysis. That is, Ss of the NF group did 
not reach fairly stable relative performance levels, 
as shown by problems attempted and net correct, 
until after the first eight-minute period. This fur- 
ther suggests that an unscored “warm-up” period 
would be beneficial, so that an individual’s per- 
formance under frustration could be compared 
with his stabilized performance without frustration. 


When the r’s for the F group were compared with 
the corresponding r’s for the NF group, few of the 
F-group r’s were found to be significantly lower. 
None would be expected to be significantly higher, 
and this was found to be the case. When the group 
differences in correlations between C I and C II 
were investigated, only those for the percentage 
correct scores were found to be significantly differ- 
ent in the expected direction (NF r = .66, Fr = 
.46; p = .05). However, this measure faired poorly 
in other comparisons (C II vs. C III and C I vs. 
C ITI) as indicated below. Since the results given 
previously suggest that it is relatively unreliable, 
the over-all results do not appear to indicate that 
the percentage correct score is a dependable 
measure of individual performance changes under 
frustration. 

When the differences between the paired F-group 
and NF-group correlations for the C II and C III 
periods were treated statistically, those for the F 
group were found to be significantly lower (p < 
.01) for both problems attempted and net correct 
scores. The respective r’s obtained are as follows: 
For problems attempted, NF r = .87 and Fr = 
.60; for net correct, NF r = .86 and Fr = .70. 
Further study is needed to clarify the significance 
of changes in performance from frustrating to post- 
frustration conditions. Casual inspection of the 
test data suggests that some Ss showed a carry- 
over of frustration reactions, others recovered 
quickly, others had little change from C I under 
either condition, while still others evidenced little 
change under C II but manifested it during the 
following period. While our results seem to indicate 
that dependable differential changes occur, their 
nature and meaning in terms of personality varia- 
bles need further investigation. 

Perhaps in part because of the low reliability of 
the error scores, no significant differences in the 
correlations of the F and NF groups were found on 
this measure. There were as many F-group r’s that 
were higher than were lower than the comparable 
NF-group values. These results, along with the pre- 
viously indicated low reliability of this measure, 
suggest that errors as well as percentage correct are 
not an adequate indicator of reactions to frustra- 
tion, at least as the test is presently designed. 

While there is considerable indication of face 
validity of the Reporting Test in observations of 
Ss taking it, in their introspective comments on it, 
as well as in the design of the frustrating situation 
itself, the present study suggests that refinement 
in scoring and construction would improve its use- 
fulness for determining frustration tolerance. 
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THE STABILITY OF AUTOKINETIC JUDGMENTS! 


J. H. ROHRER, S. H. BARON, E. L. HOFFMAN, anv D. V. SWANDER 


Urban Life Research Institute, Tulane Universily 


A MAJOR criterion used in a series of studies 
concerned with problems of personality 
structure and interpersonal interactions, carried 
out by the writers, has been rate of convergence 
to a common social norm established to perceived 
autokinetic light movement. Generalizations de- 
veloped from this experimenta! model of a social 
situation are limited by the stability, in time, of 
the norms so established. Sherif (8) in his classic 
experiment demonstrated that such norms were 
stable when tested after a period of one week. 
Bovard (2) reported stability of norms after 28 
days. This report deals with the stability of 
socially affected judgments, established to an 
autokinetic producing light S, after an elapsed time 
interval of one year. 


METHOD 


Subjects. The stability-of-norms test was carried out 
with 46 observers (Os) who had established social norms 
one year prior. All Os were undergraduate male students 
at Tulane and all were errolled in either Army, Navy, 
or Air Force ROTC training units. None of the Os was 
aware of the purpose of the study and none had expe- 
rienced the autokinetic-producing situation in the in- 
terval between test-retest. 

The experimental situation and room were identical 
for the norm formation and retesting sessions. The 
experimental situation has been described in detail else- 
where (4). The apparatus also has been described else- 
where (7). 

Experimental sessions. Three sessions were utilized: 
“T” an individual training session; ‘““X”’ a social norm 
formation session; and, “Y”’ an individual test of norm 
stability. The method of training utilized in session T 
has been described elsewhere (4). Briefly, Os were 





1 This study is one of a series carried out under ONR 
contract Nonr-475(01) with Tulane University. That 
contract provides for a systematic investigation of the 
relationships between personality, learning, and social 
variables. The project, under the direction of J. H. 
Rohrer, is a part of the research program of the Neuro- 
psychiatry Branch, Bureau of Medicine and Surgery, 
U.S. Departmerit of Navy. While this paper constitutes 
a technical report to ONR, the interpretations pre- 
sented herein are those of the authors and do not repre- 
sent, necessarily, those of the sponsoring agency, the 
Department of Navy. 


trained in individual sessions using actual! movement of 
a pinpoint of light within a circular luminous frame- 
work. The Os reported their judgments of the extent of 
movement after each exposure. Fifty exposures were 
made in this session. The actual distance that the light 
moved was determined by the experimenter (£) at the 
outset of the session and remained constant throughout 
the session, although the direction of the movement was 
systematically varied on a plane perpendicular to the O 
as permitted by the apparatus (7). 

About one hour following the training session, the 
norm formation session occurred. In this session the lu- 
minous framework was absent, the pinpoint of light re- 
mained stationary, and the auditory cues from the 
apparatus were present. In session X the Os were tested 
in pairs, one member had been trained in session T at a 
distance of 2 in., the other member had been tested at a 
distance of 8 in. During session X a verbal report of the 
perceived extent of light movement was obtained from 
the Os after each exposure. Fifty presentations were also 
made in this session. 

In session Y, which took place approximately 12 
months following session X, each O was tested individu- 
ally and judgments were given following each exposure. 
Fifty exposures were given in this session also. Thus, in 
session Y the conditions were identical with those used 
in session X, with the exception that each O was tested 
individually. 

Scoring. The scores were in terms of inches of move- 
ment reported by the O. The median of each O’s 50 
judgments, for each session, was computed. It is this 
median that constitutes the O’s norm or score for each 
session. References to ‘“‘mean score” indicate the mean 
of medians. 


RESULTS 


The data for the Os trained with a 2-in. light 
movement were analyzed separately from the data 
obtained from the 8-in. training group. This was 
done because the means of the T scores for the 
groups differ significantly—which was to be ex- 
pected—and the groups were heterogeneous with 
respect to variances. The results of an F test pro- 
vide evidence for rejecting the hypothesis of 
homogeneous variance at the .01 level of confi- 
dence. Furthermore, it was not feasible to adjust 
the autokinetic X scores to account for the differ- 
ences in T scores because an analysis of the re- 
gression data revealed that one regression line 
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could not be used for all observations. Although 
the slopes of the regression lines within the two 
groups were not significantly different and the 
regression line for the two means was linear, the 
regression coefficient for the data within the 
groups differed significantly from the regression 
coefficient for the group means. 

Table 1 presents the mean and variance for the 
T scores for Os trained at 2 in., and 8 in. respec- 
tively. When Os who had been trained with a light 
that moved 2 in. were paired with Os trained with 
a light that moved 8 in., and these pairs were 
tested together for their judgments of autokinetic 
movements, the judgments converged to produce 
the X means also presented in Table 1. The 
variances for these scores are also presented. It 
may be noted that the variability of the judgments 
in the autokinetic session is considerably larger 
than the judgments in the more structured training 
session. 

The group means for the autokinetic Y scores 
obtained one year after the X scores are also pre- 
sented in Table 1 together with their variances. 
A reduction in the group means may be noted for 
both T groups. This reduction is statistically 
significant (p < .01) for the 8-in. training group 
but not for the 2-in. training group. Differences 
between sessions have been evaluated and are 
presented in Table 2. 

In spite of the reduction in group means when 
retested after an interval of one year, there is still 
a high positive relationship between the norms 
established in sessions X and Y. This relationship 
is shown by the correlation coefficients in Table 3. 
The multiple-correlation coefficient, Ry.xr, is 
approximately the same as the simple coefficient 
of correlation between first and second autokinetic 
scores. 

In order to evaluate the relative contributions 
of X and T to the prediction of Y, the standard 
partial-regression coefficients were computed. The 
standard regression of Y on X, independent of T, 
is .739 and .600 for the 2-in. and 8-in. training 
groups respectively, while the standard regression 
of Y and T, independent of X, is .264 and —.008 
for the 2-in. and 8-in. groups respectively. It is 


TABLE 1 
MEANS AND VARIANCES 
(N = 23 in both training sessions) 





2-in. Training 8-in. Training 
Session ——— ian 
Mean Variance | Mean Variance 
T 3.08 6.08 8.74 .79 
xX 6.29 24.95 6.41 4.24 
Y 


5.50 12.54 5.09 5.06 


TABLE 2 
OBTAINED DIFFERENCES BETWEEN SESSIONS 
(N = 23 in both training sessions) 


2-in. Training 8-in. Training 








Statistic meetitaiatii = ee nee — 
7k 2 Tz X-Y 
Mean diff. | 0.79 2.33 1.32 


3.21 
t | 2.675 0.6076 | 4.8724 2.0301 
p /<.01 >.50 |<.01 <.05 








TABLE 3 
INTERCORRELATION COEFFICIENTS 
(N = 23 in both training sessions) 








Coefficient 2-in. Training | Sin. Training 
Txy .892 .595 
ryt | .693 341 
Txt 581 -582 
Ry. xr -918 -598 





evident from a comparison of these coefficients 
that the contribution of T to the prediction of Y 
is negligible. 


DISCUSSION 


In a previous study (4) it was demonstrated 
that the effects of training Os to judge the ac- 
tual distance moved by a light S, generalized to an 
autokinetic light movement situation and thus 
influenced the magnitude of the perceived auto- 
kinetic movement. Stated in more formalistic 
terms, the habit strength of an R tendency built 
up to an S of real movement, generalized to an S 
situation in which the perceived movement was 
illusionary. The data presented in this study 
demonstrate that judgmental norms of perceived 
autokinetic movement, modified in the direction of 
conformance with judgments of a second person 
in an interpersonal (social) situation, tend to negate 
the effects of the earlier individual training. More- 
over, this negation of training effects by social 
interaction persists in time, the effects of the 
social interaction manifesting itself in a ‘“non- 
social’”’ test situation after a period of one year. 
Again in more formalistic terms, the modification 
due to social interaction was in the intervening 
habit variable. 

Kelman (5) has demonstrated that felt “suc- 
cess” leads to stability of norms established to 
perceived autokinetic movement and felt “failure” 
leads to norm modification. The terms “‘success” 
and “failure” are most meaningful when used 
within a theoretical framework employing the 
concept of reinforcement. Such a framework re- 
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quires the identification of “motives” or “drives” 
that, when reduced, strengthen the tendency to 
respond. The change made in our S situation, 
which produced the altered behavior, was the 
introduction of a second O who began giving 
reports of perceived movement that were at 
variance with the judgments given by the other 
member of the pair. The altered behavior that 
resulted brought into conformance the judgments 
of perceived movement reported by both members 
of the pair. It is hypothesized that this shift was 
activated by an acquired motive for social con- 
formance.? The data argue for the shift being due 
to a drive variable, rather than due to a situa- 
tional, or S, variable because the altered R per- 
sisted when the interpersonal interaction was not 
operating (session Y). Thus a stable R tendency 
was established to the light S in the 50-trial X ses- 
sion by the reduction of the conformity drive, and 
this R tendency was manifested in the subsequent 
individual Y session. 

An attempt was made to explain the data by use 
of Bateson’s (1) and Harlow’s (3) notions of 
learning sets. The rather rapid rate of convergence 
noted in session X suggests that a set for con- 
formance produced by the interpersonal cues 
could be involved. What is known about human 
socialization practices also suggests that sets to 
conform are one of the most likely by-products of 
socialization. Yet the results obtained in session Y 
suggest that the notion of learning sets is not 
sufficient to explain the results; the specificity and 
stability of the acquired R tendency suggests the 
theoretical need for an additional notion similar 


2 Within the thecretical framework of Parsons ef al. 
(6), this would be equivalent to their “need-dispositions 
to conform to expectations” that was felt by “ego.” In 
the case of our Os, there was no evidence of overt con- 
flict involved in the conforming behavior. They were 
not aware that they were conforming, yet all converged 
very rapidly, despite the presence of acquired R tenden- 
cies, established in the T session, that would cause them 
to tend not to conform. 


597 


to what Harlow (3) has referred to as ‘“Thorndik- 
ian bondage” resulting from drive reduction. 


SUMMARY 


1. The stability of judgmental norms, estab- 
lished as a result of interpersonal interaction, was 
evaluated one year after they were established. 
The evaluation was made by testing the subjects 
individually. 

2. It was shown that the norms, so established, 
were stable. 

3. Data were presented which showed that 
social interaction resulted in a rapid modification 
of response tendencies established in a previous 
individual training session. 

4. The results obtained were interpreted within 
a reinforcement theory framework. 
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STUDIES IN THE SCOPE AND METHOD OF “THE 
AUTHORITARIAN PERSONALITY.” Edited by 
Richard Christie and Marie ‘ahoda. Glencoe, 
Illinois: The Free Press, 1954. Pp. 279. Price 
$4.50. 

This is the second in a series of which Studies in 
the Scope and Method of “The American Soldier” 
was the first. Three of the book’s six chapters are 
brief ones. Jahoda, in an editorial introduction, 
reconstructs some of the historical setting, both 
theoretical and methodological, in which the au- 
thors of The Authoritatian Personality began and 
continued their work. One of its unique features 
which she notes is its use of the methods of clinical 
psychology in conjunction with psychoanalytic 
theory “for other than clinical purposes, that is, 
for the exploration of political ideologies [as re- 
lated to] personality structure.” She also notes 
that none of the original authors was invited to 
reply to the criticisms (some of which, as will be 
shown, are rather harsh) in the volume under 
review here. 

The other two short chapters, by E. A. Shils 
and H. D. Lasswell, respectively, present an inter- 
esting contrast. The latter refers hardly at all to 
the volume by Adorno et al. but, like it, is prima- 
rily concerned with the scientific problem of the 
contribution of personality variables to behavior 
in the public arena—specifically, in their contribu- 
tion to political participation on the part of “the 
power-centered person.” Shils’s chapter, with its 
single-minded devotion to the charge that Adorno 
et al. were insufficiently aware of authoritarianism 
on the political “left,”’ keeps its eye on the authors 
but not on their scientific problem. Lasswell’s 
chapter, by contrast, is studded with relevant 
propositions, always interesting and generally 
testable. 

The final chapter, by Frenkel-Brunswik, is ap- 
propriately titled ‘Further Explorations...” 
Though it contains an occasional explanation or 
defense of something in the original volume, it is 
in no sense an apologia. The largest portion of it is 
devoted to her own post-1950 investigations of 
phenomena in children similar to those originally 
reported for older subjects. For example, prelimi- 
nary results from a set of experiments in perception 
“support the conjecture that, by and large, such 
tendencies as the quest for unqualified certainty, 
the rigid adherence to the given—be this an au- 
thority or a perceptual stimulus . . . tend to occur 
again and again within an individual in contexts 
seemingly far removed from each other.”’ 

In a final passage she raises the question whether 
or not the value biases of the authors of The 
Authoritarian Personality invalidated its conclu- 
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sions. Among several “arguments” offered in de- 
fense of a negative answer is this interesting one: 
“From this angle we must welcome the fact that 
E. R. Jaensch has given us articulate description of 
the ideal personality type from the standpoint of 
Nazi ideology. The studies of Jaensch, reported in 
some detail above...have thus certainly been 
conducted under a bias opposite to ours, yet they 
were found to lead to the recognition of similar 
patterns albeit with a different evaluative slant.”’ 

The meat of the book is to be found in two long 
and critical chapters of which the first, by H. H. 
Hyman and P.B. Sheatsley, is a methodological 
one. Their conclusions, in the main convincing to 
this reviewer, are worth quoting at some length: 


The findings rest on an admittedly unrepresentative 
sample, from which large generalizations are incau- 
tiously drawn . . ..Weaknesses in the authors’ handling 
of the quantitative data invariably work in favor of the 
assumptions, and the procedures are often inadmissible. 
... The questionnaire data are not validated save by 
their correspondence with the results of personal inter- 
views with extreme groups and...the criterion for 
validation was not independently derived ...no con- 
trol at all was exercised over the important variable of 
formal education ....Statistical tables are presented, 
the relationships are noted, and they are then “ex- 
plained” in terms of the authors’ theory. But there is no 
safeguard on the interpretations, and they become mere 
statements of opinion....Much of the qualitative 
analysis of respondents’ ideology is almost cornpletely 
unrestrained by any attachment to scientific method. 
The authors’ propensity for psychodynamic explana- 
tions leads them to ignore the widespread distribution 
of certain sentiments in the American population... 
and to attribute them instead to psychodynamic 
processes unique in the ethnocentric individual. 


These harsh judgments of the methodological 
trees do not, however, prevent the critics from 
taking an appreciative view of the research forest: 
“One feels intuitively that (the authors] have 
wisdom in their views and soundness in their con- 
clusions, and it is sad that the acumen which in- 
spired the project, the energy in executing a 
research task unparalleled in scope, and the intui- 
tive power devoted to the appraisal of the results, 
were not matched by equal methodological skill.” 

In the other long chapter, ‘‘Authoritarianism 
Re-examined,” R. Christie tackles five problems 
raised by the research reported in the original vol- 
ume or by later researches inspired thereby. (a) 
The F-scale items “did not appear to capture 
authoritarianism per se... the postulated per- 
sonality variables underlying their acceptance had 
tenuous but provocative support.” (b) The F scale 
“did discriminate between individuals who dis- 
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played behavior which was authoritarian in in- 
terpersonal relations, rather than politically”’; its 
discriminating power is ‘‘consistent” but not ‘‘ex- 
ceptionally high.” (c) The authors’ contention that 
ethnocentrism is rather highly generalized is pretty 
well supported, and its relationship to “certain 
aspects of personality appears to be well estab- 
lished”’; e.g., “those persons who view the world as 
threatening and unpredictable tend to be less 
tolerant of minority groups.”’ (d) Various measures 
related to socioeconomic status show “a sizeable 
negative correlation” with F-scale scores; “these 
findings are interpreted as running counter to the 
general argument in The Authoritatian Personality 
which emphasized purely personality determinants 
...and discounted contemporary social influ- 
ences.” (e) With regard to “the hypothesized 
greater importance of early environment in con- 
trast to adult experiences in the genesis of poten- 
tially fascistic personality ... neither the original 
data nor subsequent findings are sufficient to prove 
or disprove the hypothesis.” 

Were it the present reviewer’s function to offer 
a summary evaluation of The Authoritarian Per- 
sonality, he could hardly do it more sagely than has 
Christie: “Both [its] strength and weakness lie in 
its basic assumptions which are rooted in psycho- 
analytic theory. Such an orientation has led to the 
uncovering of a host of data which in all likelihood 
would not have been discovered by investigators 
with differing theoretical viewpoints. Despite some 
methodological weaknesses in the original research, 
subsequent findings have been predominantly 
confirmatory.” 

It is this reviewer’s function, however, to evalu- 
ate this volume of evaluations. Was it worth while 
to publish what might be termed a 275-page book 
review? The present verdict is a thrice-qualified 
affirmative. Qualification 1: Some chapters are 
carelessly edited and/or proofread. Qualification 
2: All but the two long chapters might have found 
other publication outlets. Qualification 3: Only a 
work as important as The Authoritarian Person- 
ality could justify such an outsize review. In sum- 
mary, the justification of the book under review 
lies in the importance of the book which i reviews, 
together with the carefully documented criticism 
of its two major chapters. Readers who enjoy fair, 
astute, detailed, and documented criticism of im- 
portant psychological contributions will wish that 
there were more frequert opportunities for cri- 
tiques like these. 

THEODORE M. NEWCcOMB 

University of Michigan. 


THe STRUCTURE OF PERSONALITY. By H. J. 
Eysenck. London: Methuen & Co. and New 
York: Wiley, 1953. Pp. 348. Price $5.75. 

Some fifty years of endeavor on the part of 
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several hundred psychologists are dissected and 
reported upon in this book. The literature was 
combed for empirical studies bearing on theories of 
personality structure, the proper statistical model 
for which, according to Dr. Eysenck, is R factor 
analysis, and more especially analysis into first- 
and second-order factors along Thurstone’s well- 
known lines. The theory favored by Eysenck is 
a hierarchical construction of his own, according 
to which habitual modes of response become or- 
ganized into traits, which in turn are arranged into 
a few all-embracing types. Traits and types cor- 
respond to first- and second-order factors, respec- 
tively. Traits are of the kind “shyness,” “‘per- 
sistence,”’ “rigidity”; types are groups of correlated 
traits, there being three such clusters, called intro- 
version-extroversion, neuroticism, and psychoti- 
cism. The book is a disquisition on facts to 
support this hierarchical construction, drawn vicari- 
ously from studies on the analysis of question- 
naires, objective behavior physique, 
physiological measures, interests and attitudes, 
correlation between persons, and “‘trait’’ measure- 
ments. 

Such a construction lends itself very well to 
Eysenck’s stature in this field, and we may be sure 
that some penetrating analyses are undertaken 
in these pages, many sound conclusions as to fact 
drawn, and many interesting lines pursued. 
Eysenck takes a firm grip on his comb, pulling 
it where it will. Psychologists who have to deal 
with the practical problems of society, with the 
gross diagnosis of mental disorders, or with the 
selection of personnel in group testing situations 
and the like, will find a great deal indeed to please 
them in these chapters. The present reviewer does 
not feel competent to assay some of these facts, 
but where he is able to judge them in detail, as, 
for example, in the treatment of important studies 
by Jenkins and his collaborators (3), he is of the 
opinion that Eysenck’s judgment is sound and 
constructive. The types themselves are perhaps 
badly named—introversion-extroversion in Ey- 
senck’s system is scarcely what Jung had in mind, 
and neuroticism ordinarily connotes a more specific 
set of conditions than Eysenck’s second-order 
factor embraces. The chapter on correlation of 
persons is no worse for its inadequacies than others 
written by those who adopt the primacy of R 
methodology. One ‘is sure, with such exceptions, 
that Eysenck has dealt fairly with facts, and that 
there is much substance to his construction. 

Even so, a construction of this kind is not a 
theory in a genuine sense, and Eysenck’s comb 
seems to have many broken teeth, so that much is 
left unteased on Psyche’s shaggy brow. To take 
the lesser matter first; one may wonder whether 
or not important papers in the area, which happen 
to be of a theoretical nature, have been unwisely 
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passed over on the mere ground that they are not 
factual. Thus, it may be argued that a theoreti- 
cal paper that relates perseveration to Pavlov’s 
theories of inhibition (4), that is, substantially to 
conditions of true experimental neurosis, is worth 
more attention than most so-called factual work 
on the subject of perseveration. It is clear, also, 
that the armory of factor analysis does not end 
with first- and second-order factors, and that third 
and higher orders can be expected, or have been 
unearthed: what “theory,” one wonders, will 
Eysenck offer for these? Since increase of com- 
plexity is embraced in the hierarchy, no doubt 
archetypes will correspond to third-order factors. 
Or, a third-order bipolar may perhaps correspond 
to the sexes, and fourth orders to the different 
races of mankind? These questions, it seems to this 
reviewer, lay bare the weakness of Eysenck’s 
construction: clearly it is not a theory in any hypo- 
thetico-deductive sense, as Albino (1) has already 
indicated. Eysenck’s construction is purely ad 
hoc, explaining nothing, and permitting of no 
pertinent hypotheses or deductions. 

Here, it seems, is the essential difficulty. Nothing 
genuinely theoretical about personality, or even 
about scientific method, is to be found anywhere 
in these pages. This is the more astonishing be- 
cause Eysenck professes above all else to be 
thoroughly scientific, and to be devoted to the 
hypothetico-deductive method. A_ theoretician, 
however, would have felt it essential to offer a 
theory for the types outlined in this book: several 
are possible, one, in particular, in relation to ego 
theory and defenses. Eysenck offers no such theory, 
of course, and would regard all of them, inevitably, 
as so much speculation. This is because as a 
factorist, he has grasped, above all else, the im- 
portance of operations in scientific work: for 
Eysenck a theory without actual tests and proofs 
is anathema. Most theories in personality study, 
Eysenck correctly sees, are without the essential 
operations. It may be suggested, however, that 
not all such theories are purely speculative. A 
little perspective might lead one to conclude that 
many are perhaps on the right lines, and that the 
factorist would do a really important job if he 
could provide the missing operations. The factor 
methods that Eysenck espouses so cleverly can 
in fact be placed at the service of the very theories 
that he rejects. Thus one must agree with Eysenck 
that matters of scientific proof are essential; one 
must agree, too, that factor methods can provide 
really cogent operations; but it is as well to re- 
member that operations without a genuine theory 
in attendance are apt to become the worst kind of 
scientism (2), and it is from such a charge that the 
present reviewer would like to save Eysenck. 

However, it will avail little to discuss this 
methodological difficulty. What is required from 


those of us who are factorists is considerable 
tolerance for all who have theoretical interests in 
personality studies. The necessary operations are 
not far away. And about the merits of our own 
efforts we might profit by remembering Queen 
Elizabeth I’s outburst at Cambridge University 
nearly four centuries ago when her incomparable 
virtues were being extolled by the dons of her day: 
“Non est veritas,” she blurted, “‘et utinam... .” 
So, too, for Eysenck and factor analysis. The time 
has surely come for a great deal of circumspection 
from all who are trying so hard to comb Psyche’s 
hair. Otherwise we shall never find out whether, 
after all, she only wears a wig. 
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THE MAKING OF A SCIENTIST. By Anne Roe. New 
York: Dodd, Mead, 1953. Pp. ix + 244. 
Price $3.75. 

This book by a psychologist is addressed pri- 
marily to nonpsychologists. Since the researches 
reported here are already available to psychologists 
in their professional journals, the interest of the 
volume for them will lie largely in the fact that 
it is an example of popular writing. The planning, 
the execution, the results, and the implications of 
the research are all here. 

Most readers of this journal, doubtless, have 
a speaking acquaintance with Dr. Roe’s project. 
She spent nearly four years making “clinical 
studies of vocational choice and performance in 
terms of life history and character structure.” 
Sixty-four distinguished American-born research 
scientists in the age range 30 to 60 were her sub- 
jects. They were drawn in about equal numbers 
from the ranks of biologists, physicists, and social 
scientists (psychologists and anthropologists). A 
control group of competent but less distinguished 
scientists in the same fields was also studied by 
group methods. A control group of similarly dis- 
tinguished nonscientists was not used. The results 
indicate that there are patterns in the life histories, 
intellectual abilities, and personality structure 
that are “more characteristic of scientists than 
they are of people at large, and some which are 
more characteristic of special kinds of scientists 
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than of other scientists.’’ No differences were found 
between the more eminent and the less eminent 
physicists, but in the other two groups the more 
eminent ‘“‘tended to have extremer degrees of the 
factors that characterized the men in the field.” 

The writing is clear and animated. The exposi- 
tory sections, especially those dealing with method- 
ological problems and technical information, are 
models of simple, economical writing. The human 
interest value of the subject investigated is aug- 
mented by the chatty, autobiographical cast of 
the presentation itself, as well as by the inclusion 
of many personal experiences that were incidental 
to the conduct of the research. This added human 
interest, however, is had at a price: often the 
author, as a person, stands in front of the subject 
matter of the book, and thus distractingly com- 
petes for the reader’s attention. 

Dr. Roe tells us the several purposes she had 
in writing this volume. They were (a) to “enable 
me to review the experience as a whole,” (b) “‘to 
give my subjects a more coherent and compre- 
hensive account than the technical reports can 
give anyone but an expert in my field,” (c) to 
“make it possible for others to share in a pro- 
foundly moving experience, and (d) toexamine with 
me the meaning and value of science for the man 
who does it and for society” (p. 17). It is doubtful 
that writing this sort of book is essential to achiev- 
ing the first purpose, and it is not likely that the 
author seriously believed it to be so. As to the 
second purpose, fully one fourth of the content of 
the book is not essential to achieving it; and much 
of this fourth, the reviewer believes, would annoy 
her distinguished subjects by its irrelevance. The 
third purpose, that of sharing a profoundly moving 
experience, is realized with a high degree of suc- 
cess. The fourth is really two purposes. One of 
these, the examination of the meaning of science 
for the man who does it, is achieved with distinc- 
tion. The data are carefully evaluated, and the 
generalizations are circumspect. The other, the 
examination of the value and the meaning of 
science for society, is not attained. Here the re- 
sponsible scientist yields to the beliefs and hopes 
of the socially conscious citizen-scientist. Properly 
speaking, Dr. Roe does not examine the meaning 
and value of science for society. Rather she testi- 
fies to a religious faith: “. . . the things that make 
good scientists are also the things that make good 
citizens” (p. 240). What is good for General 
Motors is good for the nation! According to the 
findings of her study, the things that make good 
scientists are experiences in the course of develop- 
ment which foster and make rewarding (a) in- 
dependence in individual thinking, and (6) in- 
dependent empirical investigation as a means of 
answering questions. Her data, however, do not 
indicate clearly or reliably that these two abilities 
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and dispositions, which she found were charac- 
teristic of her subjects in the fields in which they 
were distinguished, are characteristic of them in 
other fields, nor do her data bear on the question 
of whether or not these characteristics are sufficient 
to make good citizens. 

With her faith we cannot quarrel; indeed, many 
of us subscribe to it. But let us not embrace science 
and condemn dogma while at the same time we 
promulgate dogma in the name of science. It was 
not a scientist nor a psychoanalyst but a jurist, 
Mr. Justice Holmes, who advanced the opinion 
that the great difficulty with all thought is the 
inarticulate major premise. 


Harvard University STANLEY G. EsTEs 

RACIAL AND CULTURAL MINORITIES: AN ANALYSIS 
OF PREJUDICE AND DISCRIMINATION. By 
George Eaton Simpson and J. Milton Yinger. 
New York: Harper and Brothers, 1953. Pp. 
ix + 773. Price $6.00. 

In a few prefatory remarks the authors tell us 
why they wrote the book: the prejudice-discrimi- 
nation melon, they say in effect, has been sliced a 
great many times and from a great many angles. 
This is a good thing. But the very proliferation of 
specialized studies may obscure the important 
truth that the melon is more than the sum of its 
slices; certainly it makes the task of synthesis 
increasingly difficuit. Accordingly, it is time to re- 
examine the whole in the light of our accumulated 
knowledge about its component wedges. In the 
course of this re-examination, or in consequence of 
it, the authors hope to furnish the reader with 
guiding concepts which will make him “. . . less 
likely to adopt an oversimplified, one-factor view, 
and less likely to study one situation in isolation 
from the total society of which it is a part.” They 
propose to do this in two ways. First, by “. . . re- 
lating the various analyses to each other and to a 
systematic group of principles that underlie them 
all,” and secondly, by writing from the point of 
view that “. . .every proposition concerning inter- 
group relations should be . . . a part of the general 
principles being developed in social science.’’ As it 
turns out (perhaps atypically) these objectives 
have a subtle but profound influence on the sub- 
stance they preface. 

There are three major sections. Part I, The 
Causes and Consequences of Prejudice and Dis- 
crimination (ten chapters, 315 pages), includes 
analyses of various types of majority-minority 
situations and of a number of “approaches’”’ to the 
concept of race, a discussion of the personality and 
group-conflict functions of prejudice and of as- 
sorted consequences of prejudice and of discrimi- 
nation, and a two-chapter “case study” of the 
sociology and social psychology of anti-Semitism. 
Part II, Minorities in the Social Structure: The 
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Institutional Patterns of Intergroup Relations (ten 
chapters, 324 pages), provides a one-chapter review 
of social class and caste notions and presents data 
on economic, political, legal, familial, religious, 
educational, and artistic characteristics of promi- 
nent minority groups in the United States. Part 
III, Prejudice Discrimination and Democratic 
Values (two chapters, 71 pages), comprises a pro- 
gram of strategies with which to reduce prejudice 
and discrimination by, respectively, changing the 
prejudiced person and changing situations condu- 
cive to prejudice. Remaining materials include 
a tabular appendix on the size of various minority 
groups in the United States, Brazil, and the Union 
of South Africa, a bibliography of 498 entries, 
and separate subject and name indexes. 

In essence the book is a comprehensive review 
of historical and sociological fact and fancy regard- 
ing prejudice and discrimination and, as such, is 
potentially valuable for reference purposes. The 
authors’ knowledge of their subject is impressive, 
their approach scholarly, even polymathic, and 
their prose learned but easy. For psychologists the 
second section particularly is likely to prove useful. 
It includes a wealth of factual material, admirably 
organized about specific behavioral areas, which is 
not otherwise available in a single work. For ex- 
ample, the four chapters in this section which are 
devoted to minorities and the American political 
and legal processes and to the education of minori- 
ties in the United States provide both a fascinating 
and a compelling setting for the recent Supreme 
Court decision on segregated schools and a remark- 
able demonstration of the institutionalization of 
attitudes toward minority groups. Unfortunately, 
the book’s usefulness is impaired, per>aps vitiated, 
by several pervasive inadequacies. L.. the opinion 
of this reviewer, all are attributable to the familiar 
holistic bias implicit in the intent to “integrate” 
and to present a “total context.’’ Two are illus- 
trative. 

First, the attempt to provide what one must 
suppose is a “multifactored”’ view and to consider 
the concepts and methods of a variety of disci- 
plines leads to the inclusion of material which is of 
questionable relevance and/or dubious validity, 
and results in a diffuse and uneven treatment. 
What purports to be a critical evaluation turns out 
to be an uncritical admixture: demographic fact 
(Census data on the distribution of farms of non- 
white operators) and secondhand anecdote (“a 
Negro actress who travelled . . . through the South 
reported ...’’); hardheaded appraisal (“We have 
a measure of only the changes in responses to verbal 
tests, not of other behavioral changes’), and facile 
speculation (‘for the minority artist, the creative 
process provides emotional release ...{and]... 
social recognition’’); historical erudition (“there 
were three major dispersals of Jews during an- 
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tiquity”), and psychological naiveté (“... preju- 
dice ,.. seized upon by an insecure person may 
result in . . . a rigidity of mind and compulsiveness 
in adjustment”); devotion to science (‘to fail to 
take into account methodological problems is to 
give an impression of finality that no science ... 
can accept’) and methodological confusion (“In 
defining relations ...with so many variables in- 
volved, one is bound to run into disagreement over 
what is more and what is less’) are inextricably 
compounded. And while it may be true, for exam- 
ple, that “Reproduced, spurious, and bastard 
dialects tend to reinforce unfavorable racial stereo- 
types, [while] dialect poetry of the Sterling Brown 
type provides valuable insights into folk Negro 
life” (the reviewer and two colleagues were unable, 
in three tries, to separate the bastard and legitimate 
dialects), it is difficult to tell just where this obser- 
vation fits. The task of sorting out the tangential 
and the suppositious falls to the reader. 

Secondly, since the “general principles” which 
purportedly underlie the varied phenomena of 
minority-majority relations simply do not exist— 
the authors do not, at any rate, get around to 
stating them—the desired “synthesis” is accom- 
plished by a kind of verbal stickum. Sometimes the 
stickum is a persuasive quotation from a gifted 
writer, for example, Lillian Smith or Richard 
Wright, and sometimes it is an equally persuasive 
passage by the gifted authors. In both cases, plau- 
sibility and fluency are apparently accepted as 
criteria of truth. In a discussion of the use of anti- 
prejudice propaganda, for example, the reader is 
admonished to heed the interpretive limits imposed 
by the available experimental evidence. These 
limits are restrictive; the experimental findings 
cannot easily be extrapolated to the pressing social 
problem of reducing prejudice. Accordingly, the 
discussion concludes with the quotation of a num- 
ber of quite reasonable “principles that help one to 
understand the effectiveness of anti-prejudice 
propaganda.” These principles have wide applica- 
bility and are wholly uncorroborated. 

It is difficult to strike a balance between the 
book’s principal asset—its truly exhaustive cov- 
erage of sociological and historical data and opin- 
ion—and its principal liability—its lack of selec- 
tivity. The answer undoubtedly depends in part 
on the level of sophistication of its potential read- 
ers. The reviewer is somewhat hesitant about 
recommending its use as a text, for example, for 
fear that undergraduates will be too easily taken 
in by its facile summaries and pseudosystematic 
bias. Those better qualified to pick its meat, how- 
ever, will probably find it a useful refresher and 
reference source. 

Joserpu G. SMITH 

Commission on Human Resources 

and Advanced Training 
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SoctaL PsycHOLOGY AND INDIVIDUAL VALUES. 
By D. W. Harding. New York and London: 
Longmans, Green (Hutchinson’s University 
Library), 1953. Pp. vii + 184. Price $2.40 
(Text $1.80). 

This book is concerned with what seems to be 
the central theme of social psychology, viz., the 
behavior of the individual in the group, or, as its 
author explicitly states it, with “the development 
of the social individual.” 

While he is fully aware of the opposing belief, 
Harding begins his study of the social individual 
with the widely held biological view that all ani- 
mals, including man, are inherently social. Man’s 
sociability is not, however, an instinct. Harding 
rejects the “herd instinct’’ of Trotter, the “gregari- 
ous propensity” of McDougall, and other similar 
modes of explanation. The error in the older views 
of the social nature of man, according to Harding, 
was their attribution of specificity to human socia- 
bility. What man wants most of the time, Harding 
believes, is “‘an awareness of friendly companion- 
ship in other activities, not specifically social ones” 
(p. 15), a knowledge that his activities are taking 
place in a congenial social context which stimulates 
a sense of being reciprocally sanctioned. 

Although he prefers the biological explanation 
of an innate social disposition, the argument and 
the evidence which Harding submits are not cru- 
cial. As a matter of fact they might be used to sup- 
port the contrary position equally well. To say, 
moreover, that this innate disposition requires a 
social context for its exercise can only mean opera- 
tionally that ic is an acquired habit in the first 
place. In the face of his own analysis the author 
would have been on firmer ground had he left the 
question of priority an open one. 

Any attempt to account for the natural socia- 
bility of man inevitably runs into a collision with 
the stubborn fact of human pugnacity and hos- 
tility. Harding is as conscious of this problem as 
anyone, but his “solution” of it is unconvincing. 
In “integrative” behavior, he holds, conflict of 
aim is eliminated through the realization by the 
disputants that each looks upon the conflict from 
an exclusive perspective. By means of a special 
kind of cooperation, characterized by mutual in- 
fluence in the use of means to the achievement of 
an end, they resolve their common difficulty. This 
is the wish, no doubt, of all reasonable men; but 
the realities of human conflict force one to describe 
its consummation as unduly optimistic. 

Although man is naturally a social animal, his 
sociability, it has been said, is developed only in a 
social context. An explanation of his sociability is 
found in his early social development. The process 
of socialization, while long and varied, is summed 
up in the infant’s attachment to his mother, who 
ministers to his needs and provides him with his 
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basic satisfactions. Out of this attachment to the 
mother the infant develops a rudimentary senti- 
ment, and, as his circle of interactions widens, he 
forms sentiments with respect to other individuals 
and their behavior, the final product of which is 
his own personality. 

Unlike many psychologists, Harding is not 
afraid to tackle the problem of the moral develop- 
ment of man. In a chapter on “The Individual’s 
Morality” he deals with moral standards without 
moralizing. Being social beings, men place a high 
value on consideration for others. Whence comes 
this need to do right even when detection of doing 
wrong is unlikely? There are two reasons for this, 
one individual, the other social. ‘“‘A breach of the 
moral code that we share with other people,” 
Harding writes, “automatically deprives us of our 
sense of having group support in what we do: we 
know that our group now sanctions us not for 
what we are but for what we pretend to be”’ (p. 
54). 

The individual explanation is based on the 
holistic character of man. This consists in “the 
central effort of a living being to remain an inte- 
grated whole and avoid tensions that result from 
actions conflicting with the stable features of his 
personality, such as his sentiments and the estab- 
lished hierarchy of his values” (p. 54). Coherent 
moral behavior is not derived from membership 
in a social group but is a characteristic of man’s 
biological make-up. 

In a period of the development of social psy- 
chology when the Group, like the King of old, can 
do no wrong, it is gratifying to find in this book an 
examination of the group’s fitness for its members. 
Adjustment and conformity are here put in a bal- 
anced perspective. An important psychological 
problem is to examine the extent to which the 
group permits and to what extent it stifles the 
individual’s satisfaction of private needs and de- 
sires and his ability to adhere to values that other 
members of the group will not support. A group 
whose culture fails to bring out an individual’s 
possibilities, or one which, through rejection or iso- 
lation, handicaps its gifted members, is inadequate 
to their needs. In this connection the author rightly 
criticizes the trend of much social psychology and 
psychotherapy toward a passive and uncritical 
acceptance of the standards of the dominant social 
group. 

In the chapter on “The Social Meaning of Nor- 
mality”’ we find familiar material, but it bears 
repetition. Like most psychologists, Harding re- 
spects the statistical conception of normality, but 
he is not beguiled by its quantitative formulation. 
Neither does he go the whole way with those cul- 
tural relativists who would define normality and 
abnormality wholly in terms of cultural norms. A 
view that will do justice to the richness and com- 
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plexity of human behavior, he shows, must also 
consider the degree of integration or wholeness of 
the individual. Much ill health, he says, expresses 
“the mind’s failure to achieve or maintain integra- 
tion”; and an individual is “maladjusted” to the 
extent that his mind is “divided against itself and 
failing to maiatain its wholeness” (p. 118). 

There are other important topics in this book 
that deserve consideration but which must be 
omitted in this brief review. Harding’s analysis of 
leadership, a topic yet to be adequately treated by 
social psychologists, deserves the reader’s careful 
attention. The term “leader,” he suggests, “might 
profitably be kept for one who guides and coordi- 
nates the efforts of a number of people (or at least 
one other person) while they perform some activ- 
ity” (p. 107). A chapter each on “Innovators” and 


“The Average and the Excellent” round out the 
book and suggest a measure of the sensitivity of 
the author to human problems. 

The American reader of this British book who 
gauges its adequacy by textbook standards will miss 
a number of conventional topics. On occasion he 
may even become impatient with some of its philo- 
sophical overtones. These are on close examination, 
however, the astute observations by a sensitive 
mind of some of the broad implications of empirical 
research in social psychology. While the author is 
bound by the facts, he is not their slave. The result 
is a fine combination of scientific truth, practical 
wisdom, and literary excellence. 

HuBERT BONNER 

Columbia University. 
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